Introduction to Big Data

The paradigm has shifted in information world where increasingly deep and focused business insights are true competitive differentiators. Enterprises are under constant pressure to address business problems with “fit-for-purpose” approach than the traditionally broad approach of “one-fit-for-all”. Enterprises both private & public are required to discover moreabout their customers extending now to a level of detailed understanding of facts, relationships, indicators, patterns, trends, and pointers that were not previously available with traditional databases and processes. This makes it necessary tocapture and store more data and utilize it cost effectively to create increased value. In turn this requires dealing with new levels of data volume, data variety, and data which flows with greater velocity. Big Data is a phenomenon that has captured our attention with the advent of open source technologies (like Hadoop / Map Reduce / Cassandra etc.) to do Massive Parallel Processing using low cost commodity servers offering a new paradigm in data storage and query and offering to completely unseat the economics of RDMS data storage at all levels with new very low cost alternatives.

Big Data is a new art and science, using Massive Parallel Processing (MPP) technology, of collection, storage, processing, distribution, and analysis of data with any of the attributes – high volume, high velocity, high variety to extract high value and greater accuracy (veracity) actionable information.