LLMs can be helpful (Ie. for example, entity resolution for data cleaning) but the core models you have to use to actually make the predictions (this looks a lot more like "old" tabular data approaches).
This seems like a fun(i mean enjoyable) domain.
Also, again, if i may ask, what is your field of study? Is it related to finance or statistics?
Here's a nice summary of some of these ideas: https://differentialprivacy.org/synth-data-1/.
The main commercial opensource language for serious Statisticians is R. You can Google for the sorts of jobs requiring R as a marker, if you're interested in applications of Statistics unrelated to LLMs.
To answer your own question about classical ML, you can Google for jobs requiring the specific classical ML technologies in which you are interested as a marker.