Skip to content

lakeFS Documentation

⭐ Quickstart

Join us on Slack

lakeFS Documentation

Welcome to lakeFS
Quickstart
Quickstart
Use Cases
Use Cases
- Data Quality
  Data Quality
- Reproducibility
User Guides
User Guides
- Installing
  Installing
  - Overview
  - AWS
  - Azure
  - GCP
  - On-Premises
- Upgrading
- Work with Data locally
- Sizing Guide
Data Management
Data Management
- Git-Like Versioning
  Git-Like Versioning
- Import & Export Data
  Import & Export Data
- lakeFS Mount
- Actions and Hooks
  Actions and Hooks
- Garbage Collection
  Garbage Collection
- Metadata search
- Multiple Storage Backends
- Transactional Mirroring
- Backup and Restore
- Advanced Operations
  Advanced Operations
  - Private Link
  - S3 Virtual-host addressing
  - Monitoring & Auditing
    Monitoring & Auditing
    
    Monitoring using Prometheus
    
    Auditing
  - Migrating away
Enterprise 🚀
Enterprise 🚀
- Features
- lakeFS Cloud
- Self Managed
  Self Managed
Integrations
Integrations
- Data Processing & Compute
  Data Processing & Compute
- ML & AI
  ML & AI
- Vector Databases
  Vector Databases
  - LanceDB
- Catalogs & Metadata
  Catalogs & Metadata
- Orchestration & ETL
  Orchestration & ETL
  - Apache Airflow
  - Airbyte
- Dev & Tools
  Dev & Tools
  - Python
    Python
    
    Overview
    
    High-Level SDK
    High-Level SDK
    
    Getting Started
    
    Branches & Merging
    
    References, Commits & Tags
    
    Transactions
    
    Data Operations
    
    Generated SDK
    
    lakefs-spec
    
    Boto / S3 Gateway
  - AWS CLI
  - Git
  - R
  - MATLAB
Concepts
Concepts
- Architecture
- Model
- Data Structure
- Performance Best Practices
- Internals
  Internals
  - Versioning Internals
  - Database structure
- FAQ
- Glossary
Security
Security
- Overview
- Authentication
  Authentication
- Authorization
  Authorization
- Presigned URLs
Reference
Reference
- lakeFS API
  lakeFS API
  - Enterprise 🚀
  - OSS
- lakectl (lakeFS command-line tool)
  lakectl (lakeFS command-line tool)
  - Enterprise 🚀
  - OSS
- Authorization API
  Authorization API
  - Enterprise 🚀
  - OSS
- lakeFS Server Configuration
- S3 Gateway API
- Spark Client
Releases
Releases
Community
Community
- About the lakeFS Project
- Contributing
Legal
Legal
- Enterprise SDK License

lakeFS Quickstart¶

Welcome to lakeFS!

Tip

You can use the free trial of lakeFS Cloud if you want to try out lakeFS without installing anything.

lakeFS provides a "Git for data" platform enabling you to implement best practices from software engineering on your data lake, including branching and merging, CI/CD, and production-like dev/test environments.

This quickstart will introduce you to some of the core ideas in lakeFS and show what you can do by illustrating the concept of branching, merging, and rolling back changes to data. It's laid out in five short sections:

Start Here 👉