DB-Tools.com - system comparision
29th January 2022, Saturday
Home > System Comparision

Please Choose another system Click here

Editorial information provided by DB-Tools
Comparison Azure Data Lake Hadoop Hortonworks
 Name Azure Data Lake Hortonworks
 Version NA HDP v3.0
 Drawbacks NA --
 Advantages Develop massively parallel programs with simplicity Debug and optimize your big data programs with ease Enterprise-grade security auditing. Start in seconds scale instantly pay per job. Built on YARN and designed for the Microsoft cloud --
 Languages Supported Java Python Scala Java Python
 Website aws.amazon.com/rds/aurora/ www.hortonworks.com
 XML Support no no
 JSON Support yes yes
 Brief description Azure Data Lake Stores and analyze petabyte-size files and trillions of objects Hortonworks has three interoperable product lines: HDP (based on Apache Hadoop Apache Hive Apache Spark) HDF (based on Apache NiFi Apache Storm, Apache Kafka), and Data Plane Services (based on Apache Atlas and Cloudbreak
 Database Model Relational Database Hadoop File System (HDFS)
 Technical Documentation https://docs.aws.amazon.com/AmazonRDS/latest/AuroraUserGuide/Aurora.Overview.html https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.5/index.html
 License Commercial Commercial
 Cloud-based / SaaS SaaS Service from AWS Not Available
 Implementation Language NA NA
 Operating System Supported Not Applicable as its managed by Azure Linux
 Options for Integration / Access API Restful HTTP Restful HTTP
 Consistency NA NA
 Foreign Keys NA Not but you can join two files using Hive and Impala
 Streaming Support Yes Yes
 Analytics Support NA Using Mlib in Apache Spark
 Data Storage Schema NA Hadoop File System (HDFS)
 Notable Users NA hotels.com hilton.com
 Key Differentiator Azure Data Lake is the most stable of all the out of the box data lake solutions in the market as of sep 2018 More suited for platforms based on windows. Azure has a big partnership with hortonworks.
 Concurrency Yes Yes
 Partitioning No Yes
 Replication Yes Yes
 Secondary Indexes Yes based on secondary indexes using Solr Yes in HBase. Datawarehouse using cloudera is generally built using Hbase. Hbase has secondary indexes
 SchemaLess Yes Yes
 SQL Query NA No. HiveQL similar to SQL can be used withHive