Atlas

Data Lake. Unlock your data’s full potential.

Analyze your MongoDB application data and AWS S3 data with ease.
Try Free
Contact sales
Atlas architecture diagram highlighting Time Series within the Unified Query API category.
Forget about complex data integrations and operational overhead. Data Lake operates as a serverless, scalable query engine, delivering a simpler and faster experience when you’re working with data.
  • Quickly access all your data, wherever it resides
  • Run powerful aggregations for complex analysis
Illustration of documents caught on fishing hooks.

Analyze rich data

Preserving the rich structure of your data is invaluable. Now you can directly query Atlas databases and AWS S3 together using a single API. Run powerful and easy-to-understand aggregations so you always have a consistent experience, no matter what data type you’re using.
Simplified illustration of the data lake data stores in the Atlas product.

Transform and enrich data

Reduce your time and effort spent building the aggregations that transform and enrich your data. Data Lake reduces the effort, time-sink and complexity of pipelines and ETL tools when working with data in different formats - so you can generate the insights to power real-time applications.
Illustration of documents floating on waves.

Data on-demand at scale

Save time and money when you scale your cloud data lake. Don’t worry about managing infrastructure or predicting capacity. Only pay for what you use, and deliver the performance you need by parallelizing queries for global data lake analytics.
Illustration of a gear and a power cord connecting to an outlet.

Fully integrated with MongoDB Atlas

Spin up your cloud data lake alongside your operational Atlas databases with just a few clicks. Take advantage of our other product offerings including Compass and Charts to explore, visualize and share your data insights.
Learn more about MongoDB Atlas

Feature overview
general_features_multiple_formats
Multiple formats
Analyze data stored in JSON, BSON, CSV, TSV, Avro, ORC and Parquet in place without the complexity, cost, and time-sink of data ingestion and transformation.
mdb_aggregation_pipelines
Powerful aggregations
Run powerful, modular aggregations on data in-place and persist the results to your preferred storage tier for more control over your dataflows.
mdb_query
Federated query
Run a single query to analyze data across multiple MongoDB databases and AWS S3 together and in-place for faster insights.
atlas_serverless
Serverless
No infrastructure to set up and manage - create your cloud data lake with a few clicks and start running queries immediately.
general_features_on_demand
On demand
You only pay for the queries run and only when actively working with your data. With an on-demand service, you can eliminate the need to predict demand or capacity.
atlas_integration
Fully Integrated with MongoDB Atlas
Get access to our other product offerings such as Charts for advanced data visualization and Compass for a visual exploration of your data.

Deploy a Data Lake on MongoDB

We built Data Lake to simplify how you work with rich data. Spend more time uncovering insights instead of managing infrastructure.
View Documentation
Configure a Data Lake
Data Lake combines data from your MongoDB Atlas clusters and AWS S3 in virtual databases and collections. Your data remains in-place and in its native format.
Analyze and enrich data
Leverage MongoDB’s aggregation pipeline to combine, transform and enrich your data. Get insights quickly with federated, parallelized queries.
Persist query results
Send query results directly to an Atlas cluster or S3 bucket in your specified file format. Store data in your preferred storage tier without time-consuming ETL processes.
Configure a Data Lake
Data Lake combines data from your MongoDB Atlas clusters and AWS S3 in virtual databases and collections. Your data remains in-place and in its native format.
MQL
Analyze and enrich data
Leverage MongoDB’s aggregation pipeline to combine, transform and enrich your data. Get insights quickly with federated, parallelized queries.
MQL
Persist query results
Send query results directly to an Atlas cluster or S3 bucket in your specified file format. Store data in your preferred storage tier without time-consuming ETL processes.
MQL
MQL

Learn more about Atlas Data Lake

Discover how to analyze rich data easily and intuitively with a scalable cloud data lake.
Illustration of a chart.
Atlas Data Lake in action
See how you can combine and transform real-time application data and cloud data without complex integrations for faster insights.
Learn More
Illustration of a hand writing with a pencil on a document.
How-To
Federate queries across data sources
See step-by-step how you can federate queries across multiple data sources and easily persist results to your preferred storage.
Learn more

Get the most out of Atlas

Power more data-driven experiences and insights with the rest of our application data platform.
atlas_database
Database
Start with the multi-cloud database service built for resilience, scale, and the highest levels of data privacy and security.
Learn more

Get started with Data Lake today

Create a Data Lake alongside your operational Atlas database in a few clicks. Configure your Data Lake from multiple data sources, or use a sample dataset to get started today.
Try Free
Contact sales
GET STARTED WITH:
  • A unified data platform
  • Powerful aggregations
  • Sample datasets
  • Native tools and drivers
  • Multiple data formats
  • Pay-as-you go model