Skip to main content

End-to-end big data in a massively scalable supercomputing platform.

Open-source. Easy to use. Proven.

Try Now
Download

Join Us for Community Day - Oct 8 & 9 in Atlanta

2018 HPCC Systems Community Day

Join us at our gathering of engineers, data scientists, academia, and technology professionals who share knowledge, featured use cases / research projects, and future roadmap plans for the HPCC Systems platform.

View Additional Details

Be a part of one of the most anticipated activities of the HPCC Systems Community Day and submit your poster presentation!

Excellent prizes available for winners!

Find the answers you need

HPCC Systems enables business of all sizes to better analyze and understand data at scale, improving time to results and decisions.

It’s easier

Easier to update.

Easier to learn.

Easier to program.

Easier to integrate data.

Easier to manage clusters.

What's New?

New Release Available:
HPCC Systems 7.0.0 Beta 2 Gold

Download the Beta for access to the most significant new features we intend to include in the final gold version of this major release. Please remember that this beta contains fixes and new features that may require changes to the way you work.

New Blog:
Realtime Data Updates in ROXIE

Read the blog to learn just one way to (thoroughly) handle the scenario of live ROXIE data updates.

Highlights include:

  • File Locks
  • How your straightforward method will be different outside the development environment
  • The role of superkeys
  • How ROXIE integrates package map information during its file resolution actions
New Community Webcast:
The Download: HPCC Systems Community Tech Talks, Episode 16

The recording is now available for another episode of the community workshops to share knowledge, spark innovation, and further build and link the relationships within our HPCC Systems community.

This month we featured some of our very talented HPCC Systems summer interns:

  • Shah Muhammad Hamdi, PhD student studying Computer Science (Data Mining) at Georgia State University
  • Robert Kennedy, recently graduated Masters student in Computer Science at Florida Atlantic University
  • Aramis Tanelus, High School student studying at American Heritage School of Boca/Delray, Florida
  • Saminda Wijeratne, Masters student studying Computational Science and Engineering at Georgia Institute of Technology, Atlanta

Modern IDE support for data programming

Try playing with the code on our virtual playground and sample dataset.

Download the ECL IDE

Visit the ECL Playground

Get the Early Access VS Code ECL Extension

The HPCC Systems Platform

ETL Engine (Thor)

Extract, Transform, and Load your data using a powerful scripting language (ECL) specifically developed to work with data.

Query Engine (ROXIE)

An index based search engine to perform real-time queries. SOAP, XML, REST, and SQL are all supported interfaces.

Data Management Tools

Data Profiling, Data Cleansing, Snapshot Data Updates and consolidation, Job Scheduling and automation are some of the key features.

Predictive Modeling Tools

In place (supporting distributed linear algebra) predictive modeling functionality to perform Linear Regression, Logistic Regression, Decision Trees, and Random Forests.

Check out our blog posts

  • Arjuna Chala
    1 week 5 days ago
    I recently presented at the Open Data Science Conference (OSDC) in Boston. During my presentation, I addressed how a well-designed data lake solution can help solve fundamental data integration problems, and I used a real-world example involving analysis of a New York city taxi company’s route and fare data (which was stored in various formats depending on what was being tracked specifically) to illustrate how data analysis could improve outcomes for cab drivers, cab companies and the local government.
  • Richard Chapman
    1 week 5 days ago
    A Bloom filter, named after its inventor Burton Howard Bloom, is a data structure that can be used to perform a cheap test for the potential presence of a particular value, in a way that is much faster than looking up the value in an index, requiring much less storage than the index would. Note the “potential” there. The Bloom filter can tell you for certain if a value is not present, but it cannot say for certain that a value is present, only that it may be present.
  • Dan Camper
    1 month ago
    As new ECL programmers, we've all been there.  Using the HPCC Systems big data technology, you've successfully imported a bunch of data, analyzed it using ECL, created aggregated datasets, built an index around the aggregations and written ROXIE code to deliver query results on those aggregations in sub-second time. But then...

Browse all blog entries.

Follow us on Twitter

What people are saying about HPCC Systems

Take our survey. We plant a tree.

The best time to plant a tree was 20 years ago. The second best time is now.

Plant Now