SANTA CLARA, Calif., Feb. 26, 2013 /PRNewswire/ -- Strata Conference – Big Data software and tools provider Revelytix today announced it has successfully completed a Hadoop-based pilot for a top tier U.S. Healthcare company.
The pilot sought to establish the scalability and flexibility of Hadoop in supporting the development of new analytic models. Once data sets were loaded into Hadoop, the approach "Start small, Scale quickly" was used. Beginning with samples of the data sets, queries were iteratively developed and tested. Users wrote queries in standard query languages (e.g. SQL) which the Hadoop-integrated query engine compiled to MapReduce. With each cycle the queries became progressively complex and were then deployed across the full data sets.
The pilot also sought to examine the data from new perspectives and required substantial data integration to create views which seamlessly span multiple data sets. The healthcare company also wanted to create transparency of data with industry partners.
The Revelytix approach was to first create a domain model that described the data sets using a desired semantic set of terms. The data was then mapped to that model and the Hadoop-integrated query engine was used to pull results from all data sets, thereby providing a complete picture – in this case, of consumer behavior.
CEO Mike Lang commented, "There are two important things we learned in this Hadoop Healthcare pilot. First, Hadoop is extraordinarily well suited to build the analytic environment for mega-scale healthcare data such as this. Without question, Hadoop will definitely play a big role in big data for healthcare.
"Second, our experience again confirmed the need for the Revelytix Loom solution for managing the huge complexity of Hadoop datasets such as this. The power of Hadoop spawns files from many perspectives and analytic approaches – keeping track of the data history of each approach is a significant and ongoing challenge. Having the pre-release of the Loom data management software and workbench allowed us to keep this pilot on track. We're looking forward to going further with this work in healthcare as well as other vertical markets such as financial services, telecommunications, life sciences and oil and gas."
Revelytix provides Loom Dataset Management for Hadoop which makes managing complex datasets easier for enterprises and their data scientists. Loom Dataset Management for Hadoop is the product of the Revelytix team's 14 years of big data experience for large enterprises and the Department of Defense. Loom increases the productivity of all users by adding dataset management capability to the Hadoop information processing platform.
Revelytix www.revelytix.com 53 Loveton Circle Sparks, Maryland 21152
+1 917 690 1999 Mobile
+1 443 212 5073 Office