DB-Tools.com - system comparision
4th December 2020, Friday
 
Home > System Comparision

Please Choose another system Click here

Editorial information provided by DB-Tools
Comparison Hadoop Cloudera Hadoop Hortonworks
 Version 5.9.x HDP v3.0
 Name Cloudera Hortonworks
 Drawbacks -- --
 Advantages -- --
 Languages Supported Java Python Java Python
 Website www.cloudera.com www.hortonworks.com
 XML Support no no
 JSON Support yes yes
 Brief description Cloudera is a hybrid open-source packaged distribution primarily of Apache Hadoop, Spark, Kafka. The distribution is called CDH (Cloudera Distribution Including Apache Hadoop). It is targeted at enterprise-class deployments of Hadoop platform. Hortonworks has three interoperable product lines: HDP (based on Apache Hadoop Apache Hive Apache Spark) HDF (based on Apache NiFi Apache Storm, Apache Kafka), and Data Plane Services (based on Apache Atlas and Cloudbreak
 Database Model Hadoop File System (HDFS) Hadoop File System (HDFS)
 Technical Documentation https://www.cloudera.com/documentation.html https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.5/index.html
 License Commercial Commercial
 Cloud-based / SaaS Altus is the cloud offering of Cloudera Not Available
 Implementation Language NA NA
 Operating System Supported Linux Windows Linux
 Options for Integration / Access API Restful HTTP Restful HTTP
 Consistency NA NA
 Foreign Keys Not but you can join two files using Hive and Impala Not but you can join two files using Hive and Impala
 Streaming Support Yes Yes
 Analytics Support Using Mlib in Apache Spark Using Mlib in Apache Spark
 Data Storage Schema Hadoop File System (HDFS) Hadoop File System (HDFS)
 Notable Users Dun & Bradstreet, AoL hotels.com hilton.com
 Key Differentiator Cloudera Navigator provides lineage of the various jobs and data points. More suited for platforms based on windows. Azure has a big partnership with hortonworks.
 Concurrency Yes Yes
 Partitioning Yes Yes
 Replication Yes Yes
 Secondary Indexes Yes in HBase. Datawarehouse using cloudera is generally built using Hbase. Hbase has secondary indexes Yes in HBase. Datawarehouse using cloudera is generally built using Hbase. Hbase has secondary indexes
 SchemaLess Yes Yes
 SQL Query No. HiveQL similar to SQL can be used withHive No. HiveQL similar to SQL can be used withHive