14th April 2021, Wednesday
Comparison Apache Flink Dremio
 Drawbacks: Such Data Lakes have become data swamps due to lack of any naming conventions for files / folders. duplicate data spread everywhere
 Advantages: Data Scientists need not spend time on ETL. ETL effort made zero
 Website: https://www.dremio.com
 Brief description: No moving data to proprietary data warehouses, no cubes, no aggregation tables or extracts. Just flexibility and control for data architects, and self-service for data consumers.
 Operating System Supported: Linux, Windows not supported
 Key Differentiator: One of the Most popular Data Science Platform in the market
 Database Model: Columnar data model. The power of the CPU cache is utilized on similar column data types