Skip to main content

Fork me on GitHub

Why use HPCC Systems?

Because it’s better at bigger data. Our comprehensive, dedicated data lake platform makes combining different types of data easier and faster than competing platforms — even data stored in massive, mixed schema data lakes — and it scales very quickly as your data needs grow. It’s also open source, free to use, and easy to learn. You can acquire, enrich, deliver and curate information faster using HPCC Systems — and save time and money, now and in the future.

The HPCC Systems advantage
  • Open source data lake platform
  • Batch, real-time and streaming data ingestion
  • Built-in data enhancement and Machine Learning APIs
  • Cloud Native capability
  • Scalable to many petabytes of data
  • Runs on commodity hardware and in the cloud
  • Increased responsiveness to customers and stakeholders

Save the Date: October 10-13, 2022

This free, virtual event returns to offer plenary and breakout sessions covering a wide variety of topics, a high quality virtual workshop, as well as presentations and technical posters from students working on HPCC Systems related projects.

Discover HPCC Systems, an end-to-end data lake management solution

HPCC Systems is a mature platform that has been heavily used in commercial applications for almost two decades, predating the development of Hadoop. Created by LexisNexis Risk Solutions, an innovative pioneer in big data processing, and open source for nearly a decade now, HPCC Systems features a vibrant development community that continues to push the boundaries of big data.

This powerful, versatile platform makes it easier for developers to see the data they’re working with and manipulate it as needed. Flexible information delivery makes it easier for your clients to query and find the data they need — and it runs analysis and queries faster than other platforms such as SQL or Hadoop.

Cloud Native: HPCC Systems 8.0.0

HPCC Systems 8.0.0 combines the usability of our bare metal platform with the automation of Kubernetes to make it easy to set up, manage, and scale your big data and data lake environments.

Free. Fast. Open source.

The HPCC Systems stack consists of a full suite of components that cater to every aspect of your data workflow. Click to watch.

HPCC Systems: The End-to-End Data Lake Management Solution

Case studies

Organizations have used HPCC Systems in demanding production environments for more than a decade, making it the most proven solution of its type. Learn how innovators are using the HPCC Systems platform in these detailed case studies.

case study image
case study logo

Using Machine Learning to enhance financial data

Learn More
case study image
case study logo

Increasing customer engagement with AI

Learn More
case study image
case study logo

Using Big Data to help feed the world

Learn More
case study image
case study logo

Managing worker safety in real time

Learn More

Why these organizations use HPCC Systems

testimonial image

Integrating Multiple Data Sources

Versatile. Fast. Forgiving. Tyler shares how Reed Exhibitions utilizes HPCC Systems to integrate multiple data sources together to bring insights and data visualization to his customers in seconds.

Tyler talks about the support, ease of implementation, analytics, plugins and the dynamics of the HPCC Systems platform.

testimonial image

Creating a Single View of Data

Powerful. Agile. Simple. Mathew and Charlotte discuss how XPert HR uses HPCC Systems to enable data-driven decision making for their HR customers by allowing them to bring all of their employee data into a single view.

HPCC Systems allows XPert HR’s developers to spend less time connecting systems and more time developing features.

testimonial image

Increasing Business Value

Proven. Capable. Flexible. Fujio describes how HPCC Systems can deliver significant value by eliminating the need for multiple systems, decreasing time to market, and providing increased flexibility.


Complete capabilities for every aspect of your workflow

Our tools make managing your data easy.

Tombolo catalogs all the data assets in your data lake, including relationships.

ECL Cloud IDE makes learning ECL easy by providing rich integration between data and code without needing to install client software.

Our powerful data engines execute automatically in a highly performing, parallel work stream.

Thor, our data refinery engine, let’s you take control of data transformation. Thor can easily profile, clean, enhance, transform, and analyze mixed-schema data.

ROXIE is an index-based search engine that performs real-time queries through a variety of interfaces including SOAP, XML, REST, and SQL.

Support for tools such as Couchbase, MySQL, Kafka, and MariaDB enable real-time data ingestion for live stream IoT workloads.

The platform also leverages Tensorflow to perform CPU-based Neural Networks Learning, giving you the simplicity of ECL and the power of Tensorflow.

Built in libraries include scripts for information extraction, profiling, cleaning, normalization, and analytics.

Our Machine Learning library works efficiently in a parallel distributed environment, so you can execute Machine Learning algorithms without moving data to a new platform.

Seamless integration makes it easy to deliver the flexibility your clients need.

Our platform offers robust connectivity options and integrates with a number of third-party solutions to make data lake management as easy and seamless as possible.

ECL provides a fast, powerful coding experience from ingestion to information delivery.

ECL is an easy-to-learn, advanced, and flexible declarative language that was initially developed for complex data scenarios more than 20 years ago and has been tested and refined continuously ever since.

Ready to dive in?

HPCC Systems is free and open source, so you can test and implement it without making a big investment. Visit our Get Started page to explore the power of HPCC Systems. Whether you want to try out our ECL playground or download the full program, we've made it easy for you to get started using HPCC Systems in less than an hour.

To continue reading about the platform, checkout the About page or the HPCC Systems User Guide. If you’re looking for more technical instruction, visit our Training page to find online or in person classes.