You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There is something funny that itched in my head when it comes to "feature engineering": realizing that a lot of the times it feels like banging rocks together if Exploratory Data Analysis (EDA) is too hard or annoying to deal with, AutoML seemed to be a simple solution, but it usually takes too long. Here are some examples:
The field of Symbolic Regression seems to help this task. Originally, it was created for generating better equations, instead of using neural networks for predictions. Elegance is key, but it can also be used for tabular data.
Some of the libraries include (assuming we exclude quants):
There are like 3 different approaches to the solution (and paobably all of them need to get benchmarked by SRBench https://github.com/cavalab/srbench):
LLM-based solutions (most popular now, least explored)
Gather tabular dataset from OpenML and see if Symbolic Regression can be turned into a feature engineering tool that enhance model prediction, while minimizing computational complexity and time cost (assuming that AutoML is also an option)
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
There is something funny that itched in my head when it comes to "feature engineering": realizing that a lot of the times it feels like banging rocks together if Exploratory Data Analysis (EDA) is too hard or annoying to deal with, AutoML seemed to be a simple solution, but it usually takes too long. Here are some examples:
The field of Symbolic Regression seems to help this task. Originally, it was created for generating better equations, instead of using neural networks for predictions. Elegance is key, but it can also be used for tabular data.
Some of the libraries include (assuming we exclude quants):
There are like 3 different approaches to the solution (and paobably all of them need to get benchmarked by SRBench https://github.com/cavalab/srbench):
The goal of OpenEvolve in this case is
Beta Was this translation helpful? Give feedback.
All reactions