Tue Aug 16, 2022 10:58 pm
Login Register Lost Password? Contact Us

Please Note: The HPCC Systems forums are moving to Stack Overflow. We invite you to post your questions on Stack Overflow utilizing the tag hpcc-ecl (https://stackoverflow.com/search?tab=newest&q=hpcc-ecl). This legacy forum will be active and monitored during our transition to Stack Overflow but will become read only beginning September 1, 2022.

Random forest implementation

Topics related to the set of Machine Learning libraries and Matrix processing algorithms

Thu Apr 14, 2016 4:09 pm Change Time Zone


This question may be better suited to someone in technology, but I was wondering about the feasibility of doing the following. We are working on a project in Analytics that seems very computationally demanding. Is this something we could potentially do in production or is this just too much?

Here are the parameters:
- Each person will have about 100 data points that will need to be refreshed periodically, every month.
- We need to fit one random forest model per person.
- There are 50,000+ users and thus 50,000+ models.
- We need to score future data points based on the tree model for each user.
- We need to both train the model in production and also score the model in production.

Thank you.
Posts: 5
Joined: Fri Dec 11, 2015 3:36 pm

Return to Machine Learning

Who is online

Users browsing this forum: No registered users and 1 guest