Skip to main content

End-to-end big data in a massively scalable supercomputing platform.

Open-source. Easy to use. Proven.

Try Now
Download

Find the answers you need

HPCC Systems enables business of all sizes to better analyze and understand data at scale, improving time to results and decisions.

Easier

Easier to update.

Easier to learn.

Easier to program.

Easier to integrate data.

Easier to manage clusters.

What's New?

New Release Available:
HPCC Systems 7.2.0 Gold

Download now to take advantage of new features and improvements in ECL IDE, Java Embed, Spark, and more! View the release notes for the full list of changes. The supporting documentation is available on our website and we recommend that you browse the RedBook for additional information about specific items. Learn more in the Forum Announcement.

New Blog:
ECL Tips – All about the ECL SET

Read the blog to learn about ECL SET, a useful tool in query design and processing. Highlights include ECL SET basics, types, and other features of ECL SET. It examines ECL functions and expressions that incorporate ECL SET, provides examples and best practices, as well as how ECL SETs and DATASETs interact.

New 5 Questions Interview:
Jo Prichard

Listen to this edition of “5 Questions” where Flavio interviews Jo Prichard, senior data scientist at LexisNexis Risk Solutions. Hear Jo’s perspective on how his work helps LexisNexis Risk Solutions prevent and/or mitigate fraud, the advantages HPCC Systems provides compared to other open-sourced platforms, and his experience being a mentor for the HPCC Systems internship program.

Modern IDE support for data programming

Try playing with the code on our virtual playground and sample dataset.

Download the ECL IDE

Visit the ECL Playground

Get the VS Code ECL Extension

The HPCC Systems Platform

ETL Engine (Thor)

Extract, Transform, and Load your data using a powerful scripting language (ECL) specifically developed to work with data.

Query Engine (ROXIE)

An index based search engine to perform real-time queries. SOAP, XML, REST, and SQL are all supported interfaces.

Data Management Tools

Data Profiling, Data Cleansing, Snapshot Data Updates and consolidation, Job Scheduling and automation are some of the key features.

Machine Learning Tools

In place (supporting distributed linear algebra) predictive modeling functionality to perform Linear Regression, Logistic Regression, Decision Trees, and Random Forests.

Check out our blog

  • Cassandra Walker
    5 days ago
    This ECL Tip spotlights the Enterprise Control Language (ECL) SET and the functions that incorporate SET. ECL SET is a useful tool in query design and processing. One of the most common applications for ECL SET is to simplify code by eliminating multiple “OR” conditions in a filter.
  • Jessica Lorti
    1 week 6 days ago
    Jo Prichard is a senior data scientist at LexisNexis Risk Solutions and a long-time HPCC Systems user. Focusing on big data research and development, Jo helps enterprises target fraud, collusion and other socio-behavioral risk factors.
  • Roger Dev
    1 month ago
    Text Vectorization allows for the mathematical treatment of textual information. Words, phrases, sentences, and paragraphs can be organized as points in high-dimensional space such that closeness in space implies closeness of meaning. HPCC Systems' new TextVectors module supports vectorization for words, phrases, or sentences in a parallelized, high-performance, and user-friendly package.

Browse all blog entries.

Follow us on Twitter

What people are saying about HPCC Systems

Take our survey. We plant a tree.

The best time to plant trees is 20 years ago. Now is the second best time.

Plant Now