What is the difference between traditional RDBMS and Hadoop?


RDBMSHadoop
Data TypesRDBMS relies on the structured data and the schema of the data is always known.Any kind of data can be stored into Hadoop i.e. Be it structured, unstructured or semi-structured.
ProcessingRDBMS provides limited or no processing capabilities.Hadoop allows us to process the data which is distributed across the cluster in a parallel fashion.
Schema on Read Vs. WriteRDBMS is based on ‘schema on write’ where schema validation is done before loading the data.On the contrary, Hadoop follows the schema on read policy.
Read/Write SpeedIn RDBMS, reads are fast because the schema of the data is already known.The writes are fast in HDFS because no schema validation happens during HDFS write.
CostLicensed software, therefore, I have to pay for the software.Hadoop is an open source framework. So, I don’t need to pay for the software.
Best Fit Use CaseRDBMS is used for OLTP (Online Trasanctional Processing) system.Hadoop is used for Data discovery, data analytics or OLAP system.


Comments

Popular posts from this blog

JobTracker and TaskTracker

What is Cross Validation?