Configuring the Storage
A production installation of lakeFS will usually use your cloud provider’s object storage as the underlying storage layer. You can choose to create a new bucket/container (recommended), or use an existing one with a path prefix. The path under the existing bucket/container should be empty.
After you have a bucket/container configured, proceed to Installing lakeFS.
Choose your cloud provider to configure your storage.
Table of contents
AWS S3
- From the S3 Administration console, choose
Create Bucket
. - Make sure you:
- Block public access
- Disable Object Locking
-
Go to the
Permissions
tab, and create a Bucket Policy. Use the following structure:{ "Id": "Policy1590051531320", "Version": "2012-10-17", "Statement": [ { "Sid": "Stmt1590051522178", "Action": [ "s3:GetObject", "s3:GetObjectVersion", "s3:PutObject", "s3:AbortMultipartUpload", "s3:ListMultipartUploadParts", "s3:GetBucketVersioning", "s3:ListBucket", "s3:GetBucketLocation", "s3:ListBucketMultipartUploads", "s3:ListBucketVersions" ], "Effect": "Allow", "Resource": ["arn:aws:s3:::<BUCKET_NAME>", "arn:aws:s3:::<BUCKET_NAME_WITH_PATH_PREFIX>/*"], "Principal": { "AWS": ["arn:aws:iam::<ACCOUNT_ID>:role/<IAM_ROLE>"] } } ] }
Replace
<ACCOUNT_ID>
,<BUCKET_NAME>
and<IAM_ROLE>
with values relevant to your environment.IAM_ROLE
should be the role assumed by your lakeFS installation.Alternatively, if you use an AWS user’s key-pair to authenticate lakeFS to AWS, change the policy’s Principal to be the user:
"Principal": { "AWS": ["arn:aws:iam::<ACCOUNT_ID>:user/<IAM_USER>"] }
You can now proceed to Installing lakeFS.
Microsoft Azure Blob Storage
Create a container in Azure portal
- From the Azure portal, Storage Accounts, choose your account, then in the container tab click
+ Container
. - Make sure you block public access
Authenticate with Secret Key
In case you want to use the secret key for authentication you will need to use the account key in the configuration Go to the Access Keys
tab and click on Show Keys
save the values under Storage account name
and Key
we will need them in the installing lakeFS step
Authenticate with Active Directory
In case you want your lakeFS Installation (we will install in the next step) to access this Container using Active Directory authentication, First go to the container you created in step 1.
- Go to
Access Control (IAM)
- Go to the
Role assignments
tab - Add the
Storage Blob Data Contributor
role to the Installation running lakeFS.
You can now proceed to Installing lakeFS.
Google Cloud Storage
-
On the Google Cloud Storage console, click Create Bucket. Follow the instructions.
-
On the Permissions tab, add the service account you intend to use lakeFS with. Give it a role that allows reading and writing to the bucket, e.g. Storage Object Creator.
You can now proceed to Installing lakeFS.