Skip to main content Link Menu Expand (external link) Document Search Copy Copied

Apache Hadoop

Hadoop is an open-source software framework for storing data and running applications on [cluster] of commodity hardware. It provides massive storage for any kind of data, enormous processing power and the ability to handle virtually limitless concurrent tasks or jobs ([data-pipeline]).