Seems like a smart way to conduct this study. But the implications are scary. Maybe platforms should automatically do things that help anonymize.
I'm not sure if/how they selected for people actually trying to be anonymous versus someone like me who explicitly wants the connections to be easy and link it all over.
Curious if I'm in the dataset, am I able to find out?
Also, there is an old HN post that worked just on HN data, pre LLM. Submit some text and it gives you the most likely HN users with confidence scores.
We have a lot of "fingerprints", our writing being one. Interestingly, Ai may actually be a way to anonymize your writing