What is Databases?
A database is a storage unit for structured data. They structure your data for organization, protection, and delivery of data. You can think of a database as a library. Libraries contain a lot of information stored as books, and they are systematically organized for retrieval.
There are several types of database types with the exception of Hadoop, which is a software framework that supports distributed storage of data:
Relational databases are the most common type of database. They store data in tabular relations such as rows and columns that make up a table similar to a spreadsheet. They include popular database management systems such as Oracle Database, MySQL and SQL Server. Almost all relational databases rely on structured query language (SQL) for managing data in database systems.
NoSQL is a modern database aimed at supporting large sets of distributed data. The development of NoSQL began as a response to big data, which calls for highly scalable, distributed architecture and flexible data requirements. The name “NoSQL” is intended to showcase that NoSQL database solutions do not only operate in SQL languages or interfaces.
Apache’s Hadoop is an open source framework written in Java for distributed storage and processing of large datasets (also known as big data) spread across many clustered commodity hardware such as servers. It can handle large volumes of structured, semi-structured, and unstructured data such as images, audio or video files. Hadoop is not a database and is far from structured so it does not use query languages like SQL, which relational databases use. Hadoop is deployed as software through many companies such as Cloudera that often provide a SQL-like language for ease of use.