AI therapy bots fuel delusions and give dangerous advice, Stanford study finds(arstechnica.com)

242 pointsby pseudolus7 months ago31 comments

m30477 months ago
It was put forward in 1960s (maybe? Robert Anton Wilson? and for parallel purposes Philip K Dick's percept / concept feedback cycle) science fiction, and having therefore casually looked for phenomena when support / disprove this hypothesis over the intervening years: that people in power necessarily become functionally psychotic because people will self-select to be around them as a self-preserving / promoting opportunity (sycophants) who cannot help but filter shared observations through their own biases, this is profoundly unsurprising to me.
If you choose to believe as Jaron Lanier does that LLMs are a mashup (or as I would characterize it a funhouse mirror) of the human condition, as represented by the Internet, this sort of implicit bias is already represented in most social media. This is further distilled by the cultural practice of hiring third world residents to tag training sets and provide the "reinforcement learning"... people who are effectively if not actually in the thrall of their employers and can't help but reflect their own sycophancy.
As someone who is therefore historically familiar with this process in a wider systemic sense I need (hope for?) something in articles like this which diagnoses / mitigates the underlying process.
- BLKNSLVR7 months ago
  I'm just going to re-write what you've written with a bit of extra salt:
  Artificial intelligence: An unregulated industry built using advice from the internet curated by the cheapest resources we could find.
  What can we mitigate your responsibility for this morning?
  I've had AI provide answers verbatim from a self-promotion card of the product I was querying as if it was a review of the product. I don't want to chance a therapy bot quoting a single source that, whilst it may be adjacent to the problem needing to be addressed, could be wildly inappropriate or incorrect due to the sensitivities inherent where therapy is required.
  (likely different sets of weightings for therapy related content, but I'm not going to be an early adopter for my loved ones - barring everything else failing)
  - m30477 months ago
    I kind of wonder why psych bots aren't regulated as medical devices, since ML diagnostic products certainly are.
- theendisney7 months ago
  My theory is that the further up the hierarcy the beneficial decisions are often harmful to those below which requires emotional distancing which even further up becomes full blown collective psychopaty. The yes men grow close while everyone else floats away.
- mlinhares7 months ago
  Every single empire falls into this, right? The king surrounds himself with useless sycophants that can't produce anything but are very good at flattering him, he eventually leads the empire to ruin, revolution happens, the cycle starts anew.
  I wish I could see hope in the use of LLMs but i don't think the genie goes back into the bottle, the people prone to this kind of delusion will just dig a hole and go deep until they find the willpower or someone on the outside to pull them out. Feels to me like gambling, there's no power that will block gambling apps due to the amount of money they fuel into lobbying so the best we can do is try to help our friends and family and prevent them from being sucked into it.
  - dragontamer7 months ago
    Certainly not the story of, ex: the Mongol Empire. Which is the Great Khan dies but he was the big personality holding everything together.
    There were competent kings and competent Empires.
    Indeed, it's tough to decide where the Roman Empire really began it's decline. It's not a singular event but a centuries long decline. Same with the Spanish Empire and English Empire.
    Indeed, the English Empire may have collapsed but that's mostly because Britain just got bored of it. There's no traditional collapse for the breakup of the British Empire
    ---------
    I can think of some dramatic changes as well. The fall of the Tokugawa Shogunate of Japan wasn't due to incompetence, but instead the culture shock of a full iron battleship from USA visiting Japan when they were still a swords and samurai culture. This broke the Japanese trust in the Samurai system and led to a violent revolution resulting in incredible industrialization. But I don't think the Tokugawa Shogunate was ever considered especially corrupt or incompetent.
    ---------
    Now that being said: Dictators fall into the dictator trap. A bad king who becomes a narcissist and dictator will fall under the pattern you describe. But that doesn't really happen all that often. That's why it's so memorable when it DOES happen
    somenameforme7 months ago
    > the English Empire may have collapsed but that's mostly because Britain just got bored of it. There's no traditional collapse for the breakup of the British Empire
    I completely agree with the point you're making, but this part is simply incorrect. The British Empire essentially bankrupted itself during WW2, and much of its empire was made up of money losing territories. This led them to start 'liberating' these territories en masse which essentially signaled the end of the British Empire.
    ClumsyPilot7 months ago
    It is ironic and sad that colonies were both oppressed and not profitable.
    The way Britain has restricted Industry in India (famously even salt) left it vulnerable in WW2.
    Colonial policies are really up there with great failures of communists
- vasco7 months ago
  > As someone who is therefore historically familiar with this process in a wider systemic sense
  What does "being historically familiar with a process in a wider systemic sense" mean? I'm trying to parse this sentence without success.
  - kelseyfrog7 months ago
    I'm reading it to say, having working knowledge of intra-personal structures in a way that is contingent on historical context. These would be social, economic, religious, family, political, patterns of relation that groups of people exist in.
    The assumption GP is making is that the incentives, values, and biases impressed upon folks providing RL training data may systematically favor responses along a certain vector that is the sum of these influences in a way that doesn't cancel out because the sample isn't representative. The economic dimension for example is particularly difficult to unbias because the sample creates the dataset as an integral part of their job. The converse would be collecting RL training data from people outside of the context of work.
    While that it may not be feasible or even possible to counter, that difficulty or impossibility doesn't resolve the issue of bias.
  - m30477 months ago
    Thank you everyone for the love.
    I read Robert Anton Wilson and Philip K Dick many years ago. I've been observing a recurring feature in human thought / organization ever since. People in this thread have done a pretty good job with the functional psychosis part, but I encourage considering percept / concept as well: what this is is the notion that what we see influences our mental model, but it works the other way as well and our mental model influences what we're capable of seeing. Yes, sort of like confirmation bias, but much more disturbing. For example, in the CIA's online library there is a coursebook titled _Psychology of Intelligence Analysis_ (1999) and one of the topics discussed is: "Initial exposure to blurred or ambiguous stimuli interferes with accurate perception even after more and better information be- comes available." Particularly fascinating to me is that people who are first shown a picture which is too blurry to make out take longer to correctly identify it as it is made clearer. https://www.cia.gov/resources/csi/books-monographs/psycholog...
    My father was a psychiatrist. I'm interested in various facets of how people come to regard each other and their surroundings. I'm fascinated with the role language plays in this. I personally believe that computer programming languages and tech stacks provide a uniquely objective framework for evaluating the emergence of "personality" in cultures.
    "Diagnosticity is the informational value of an interaction, event, or feedback for someone seeking self-knowledge." https://dictionary.apa.org/diagnosticity
    Environments which lack information (diagnosticity) encourage the development of neuroses: sadism, masochism, ritual, fetishism, romanticism, hysteria, superstition, etc., etc. I have observed that left to stew in their own juices the spontaneous cultures which emerge around different languages / stacks tend to gravitate towards language-specific constellations of such neuroses; I'm not the only person who has observed this. I tend towards the "radar chart" methodology described in Leary's _Interpersonal Diagnosis of Personality_ (1957); but here's a great talk someone gave at SXSW one year which explores a Lacanian model: https://www.youtube.com/watch?v=mZyvIHYn2zk
    theGnuMe7 months ago
    People are extremely uncomfortable with uncertainty, especially about themselves. So they create explanations... Programmers also don't like uncertainty so they create programming languages. There's also a bit of "not invented here" syndrome.
    Languages like Haskell are really applied type theory etc... In some sense, the academics invent languages for different levels of abstraction to ultimately write papers about how useful they are.
    In terms of programming languages, personality wise, in the end it's all javascript. Then there is Java and the Jvm which is on a mission to co-opt multiple personalities.
    nobodyandproud7 months ago
    “our mental model influences what we're capable of seeing.”
    This is too common. I’d like to think the Socratic method and mindset helps one break out of this rut.
    m30477 months ago
    Check out the CIA's free coursebook referenced above. It's got good stuff in it. (Your tax dollars at work.)
- zpeti7 months ago
  How about all the people out there who are at rock bottom, or have major issues, are not leaders, are not at the top of their game, and need some encouragement or understanding?
  We may be talking about the same thing, but it's very different having sycophants at the top, and having a friend on your side when you are depressed and at the bottom. Yet both of them might do the same thing. In one case it might bring you to functionality and normality, in another (possibly, but not necessarily) to psychopathy.
- m30477 months ago
  Geoff Lewis has been sampling the product. Will this turn into a cultural thing amongst VCs? (Has it already?)
  https://futurism.com/openai-investor-chatgpt-mental-health
AdieuToLogic7 months ago
How is anyone versed in LLM technical details surprised by this?
They are very useful algorithms which solve for document generation. That's it.
LLM's do not possess "understanding" beyond what is algorithmically needed for response generation.
LLM's do not possess shared experiences people have in order to potentially relate to others in therapy sessions as LLM's are not people.
LLM's do not possess professional experience needed for successful therapy, such as knowing when to not say something as LLM's are not people.
In short, LLM's are not people.
- coliveira7 months ago
  Computer scientists are, in part, responsible for the public confusion about what LLMs are and can do. Tech investors and founders, however, are the biggest liars and BS peddlers when they go around saying irresponsible things like LLMs are on the verge of becoming "conscious" and other unfounded and impossible things (for LLMs). It's not a surprise that many people believe that you can have a personal "conversation" with a tool that generates text based on statistical analysis of previous data.
- wongarsu7 months ago
  Self-help books help people (at least sometimes). In an ideal world an LLM could be like the ultimate self-help book, dispensing the advice and anecdotes you need in your current situation. It doesn't need to be human to be beneficial. And at least from first-order principles it's not at all obvious that they are more harmful than helpful. To me it appears that most of the harm is in the overly affirming sycophant personally most of them are trained into, which is not a necessary or even natural feature of LLMs at all
  Not that the study wouldn't be valuable even if it was obvious
  - Retric7 months ago
    Self-help books are designed to sell, they’re not particularly useful on their own.
    LLM’s are plagued by poor accuracy so they preform terribly in any situation where inaccuracies have serious downsides and there is no process validating the output. This is a theoretical limitation of the underlying technology, not something better training can fix.
    Dylan168077 months ago
    I don't think that argument is solid enough. "serious downsides" doesn't always mean "perform terribly".
    Most unfixable flaws can be worked around with enough effort and skill.
    Retric7 months ago
    At scale it does when “serious downsides” are both common and actually serious like death.
    Suppose every time you got into your car an LLM was going to recreate the all safety critical software from an identical prompt but using slightly randomized output. Would you feel comfortable with such an arrangement?
    > Most unfixable flaws can be worked around with enough effort and skill.
    Not when the underlying idea is flawed enough. You can’t get from the earth to the moon by training yourself to jump that distance, I don’t care who you’re asking to design your exercise routine.
    Dylan168077 months ago
    > At scale it does when “serious downsides” are both common and actually serious like death.
    Yeah but the argument about how it works today is completely different from the argument about "theoretical limitations of the underlying technology". The theory would be making it orders of magnitude less common.
    > Not when the underlying idea is flawed enough. You can’t get from the earth to the moon by training yourself to jump that distance, I don’t care who you’re asking to design your exercise routine.
    We're talking about poor accuracy aren't we? That doesn't fundamentally sabotage the plan. Accuracy can be improved, and the best we have (humans) have accuracy problems too.
    Retric7 months ago
    > The theory would be making it orders of magnitude less common.
    LLM’s can’t get 3+ orders of magnitude better here. There’s no vast untapped reserves of clean training data, and tossing more processing power quickly results in overfitting existing training data.
    Eventually you need to use different algorithms.
    > That doesn’t fundamentally sabotage the pan. Accuracy can be improved
    Not nearly far enough to solve the issue.
    subjectsigma7 months ago
    OP never said that serious downsides = perform terribly, he said that in situations where the consequences are severe you don’t want to use LLMs.
    > Most unfixable flaws can be worked around with enough effort and skill.
    Such a ridiculous example of delusional LLM hype, comments like this are downright offensive to me.
    “Your therapy bot is telling vulnerable people to kill themselves, they probably should have applied more skill and effort to being in therapy”
    Dylan168077 months ago
    > Such a ridiculous example of delusional LLM hype, comments like this are downright offensive to me.
    Sorry you got offended at a thing I didn't say.
    That was a generic comment about designers/engineers/experts, not untrained users.
    Also you swapping "can be" with "should" strongly changes the meaning all by itself. Very often you can force a design to work but you should not.
    subjectsigma7 months ago
    I have no idea how you think “ they probably could have” sounds any better, or how it makes your argument stronger at all. If we can apply AI to these situations but shouldn’t, why even bother with your first comments?
    Dylan168077 months ago
    > I have no idea how you think “they probably could have” sounds any better, or how it makes your argument stronger at all.
    When I talk about "can" I'm talking about in the medium future or further, not what anyone is using or developing right now. It's "can someday" not "could have".
    > If we can apply AI to these situations but shouldn’t, why even bother with your first comments?
    Because I dislike it when people conflate "this technology has flaws that make it hard to apply to x task" with "it is impossible for this category of technology to ever be useful at x task"
    And to be clear, I'm not saying "should" but I'm not saying "shouldn't" either, when it comes to unknown future versions of LLM technology. I'll make that decision later. The point is that the range of "can" is much wider than the range of "should", so when someone says "can't" about all future versions of a technology they need extra strong evidence.
  - subjectsigma7 months ago
    I’ve only ever read three self-help books but they were profoundly useless. All three could have been a two-page blog post of dubious advice. Never buying a self-help book again. If that’s what therapy LLMs are training on I hate the idea even more than I did before.
  - 7 months ago
    undefined
  - intended7 months ago
    I have stopped using an incredibly benign bot that I wrote, even thought it was supremely useful - because it was eerily good at saying things that “felt” right.
    Self help books do not contort to the reader. Self help books are laborious to create, and the author will always be expressing a world model. This guarantees that readers will find chapters and ideas that do not mesh with their thoughts.
    LLMs are not static tools, and they will build off of the context they are provided, sycophancy or not.
    If you are manic, and want to be reassured that you will be winning that lottery - the LLM will go ahead and do so. If you are hurting, and you ask for a stream of words to soothe you, you can find them in LLMs.
    If someone is delusional, LLMs will (and have already) reinforced those delusions.
    Mental health is a world where the average/median human understanding is bad, and even counter productive. LLMs are massive risks here.
    They are 100% going to proliferate - for many people, getting something to soothe their heart and soul, is more than they already have in life. I can see swathes of people having better interactions with LLMs, than they do with people in their own lives.
    quoting from the article:
    > In an earlier study, researchers from King's College and Harvard Medical School interviewed 19 participants who used generative AI chatbots for mental health and found reports of high engagement and positive impacts, including improved relationships and healing from trauma.
- BeetleB7 months ago
  > In short, LLM's are not people.
  Not really sure that is relevant in the context of therapy.
  > LLM's do not possess shared experiences people have in order to potentially relate to others in therapy sessions as LLM's are not people.
  Licensed therapists need not possess a lot of shared experiences to effectively help people.
  > LLM's do not possess professional experience needed for successful therapy, such as knowing when to not say something as LLM's are not people.
  Most people do not either. That an LLM is not a person doesn't seem particularly notable or relevant here.
  Your comment is really saying:
  "You need to be a person to have the skills/ability to do therapy"
  That's a bold statement.
  - padolsey7 months ago
    >> LLM's do not possess professional experience needed for successful therapy, such as knowing when to not say something as LLM's are not people.
    > Most people do not either. That an LLM is not a person doesn't seem particularly notable or relevant here.
    Of relevance I think: LLMs by their nature will often keep talking. They are functions that cannot return null. They have a hard time not using up tokens. Humans however can sit and listen and partake in reflection without using so many words. To use the words of the parent comment: trained humans have the pronounced ability to _not_ say something.
    BeetleB7 months ago
    All it takes is a modulator that controls whether to let the LLM text through the proverbial mouth or not.
    (Of course, finding the right time/occasion to modulate it is the real challenge).
    AdieuToLogic7 months ago
    > All it takes is a modulator that controls whether to let the LLM text through the proverbial mouth or not.
    > (Of course, finding the right time/occasion to modulate it is the real challenge).
    This is tantamount to saying:
    All you have to do to solve a NP-hard[0] problem is to make a polynomial solution. (Of course, proving P = NP is the real challenge).
    0 - https://en.wikipedia.org/wiki/NP-hardness
    fc417fc8027 months ago
    Your analogy would be better if it were the construction of a heuristic.
    GP seems to have a legitimate point though. The absence of a workable solution at present does not imply the impossibility of such existing in the not so distant future.
  - zpeti7 months ago
    A lot of the comparisons I see revolve around comparing a perfect therapist to an LLM. This isn't the best comparison, because I've been to 4 different therapists over my life an only one of them actually helped me (2 of them spent most of the therapy telling me stories about themselves. These are licensed therapists!!) There are really bad therapists out there.
    An LLM, especially chatgpt is like a friend who's on your side, who DOES encourage you and takes your perspective every time. I think this is still a step up from loneliness.
    And a final point, ultimately an LLM is a statistical machine that takes the most likely response to your issues based on an insane amount of human data. Therefore it is very likely to actually make some pretty good calls about what it should respond, you might even say it takes the best (or most common) in humanity and reflects that to you. This also might be better than a therapist, who could easily just view your sitation through their own live's lense, which is suboptimal.
  - autumnstwilight7 months ago
    > Licensed therapists need not possess a lot of shared experiences to effectively help people.
    Sure, they don't need to have shared experiences, but any licensed therapist has experiences in general. There's a difference between "My therapist has never experienced the stressful industry I work in" and "My therapist has never experienced pain, loneliness, fatigue, human connection, the passing of time, the basic experience of having a physical body, or what it feels like to be lied to, among other things, and they are incapable of ever doing so."
    I expect if you had a therapist without some of those experiences, like a human who happened to be congenitally lacking in empathy, pain or fear, they would also be likely to give unhelpful or dangerous advice.
  - ClumsyPilot7 months ago
    > You need to be a person to have the skills
    Generally a non-person doesn’t have skills, it’s a pretty likely to be true statement even if made on a random subject.
    BeetleB7 months ago
    Once again: The argument appears to be "LLMs cannot be therapists because they are LLMs." Circular logic.
    > Generally a non-person doesn’t have skills,
    A semantic argument isn't helpful. A chess grandmaster has a lot of skill. A computer doesn't (according to you). Yet, the computer can beat the grandmaster pretty much every time. Does it matter that the computer had no skill, and the grandmaster did?
    That they don't have "skill" does not seem particularly notable in this context. It doesn't help answer "Is it possible to get better therapy from an LLM than from a licensed therapist?"
    crashabr7 months ago
    > Does it matter that the computer had no skill, and the grandmaster did?
    Yes it does when pondering about the transferability of the skills mobilized to achieve the result (grandmaster status) to other domains.
- never_inline7 months ago
  1. Ars Technica's (OP website) audience includes tech enthusiast people who don't necessarily have a mental model of LLMs, instruction tuning or RLHF.
  2. why would this "study" exist? - for the reason computer science academics conduct study on whether LLMs are empirically helpful in software engineering. (The therapy industrial complex would also have some reasons to sponsor this kind of a research, unlike SWE productivity studies where the incentive is usually the opposite.)
  - AdieuToLogic7 months ago
    Both great points.
    For the record, my initial question was more rhetorical in nature, but I am glad you took the time to share your thoughts as it gave me (and hopefully others) perspectives to think about.
- gonzobonzo7 months ago
  It feels like 95% of the people are responding to the headline instead of reading the article. From the article:
  > The Stanford research tested controlled scenarios rather than real-world therapy conversations, and the study did not examine potential benefits of AI-assisted therapy or cases where people have reported positive experiences with chatbots for mental health support. In an earlier study, researchers from King's College and Harvard Medical School interviewed 19 participants who used generative AI chatbots for mental health and found reports of high engagement and positive impacts, including improved relationships and healing from trauma. ** > "This isn't simply 'LLMs for therapy is bad,' but it's asking us to think critically about the role of LLMs in therapy," Haber told the Stanford Report, which publicizes the university's research. "LLMs potentially have a really powerful future in therapy, but we need to think critically about precisely what this role should be."
- mise_en_place7 months ago
  The average person in 2025 has been so thoroughly stripped of their humanity, that even a simulacrum of a human is enough.
BoredPositron7 months ago
I am bipolar and I help run a group. We lost some people to chatbots already that either fueled a manic or a depressive episode.
- ethan_smith7 months ago
  This matches what several mental health professionals I know have reported - AI chatbots tend to validate rather than appropriately challenge potentially harmful thought patterns during mood episodes.
- sherdil20227 months ago
  Lost as in ‘not meeting anymore since they are using chatbots instead’ or ‘took their lives’?
  - BoredPositron7 months ago
    Both but it's mostly not the therapy chatbots or normal "chatgpt" those are worse enough. It's these dumbass ai girlfriend/boyfriend bots that run on uncensored small models. They get unhinged really fast.
    irjustin7 months ago
    That's SUPER interesting because obviously the researchers didn't look here first. That even if "therapy chatbots" were fixed, you'd still have a massive space where the true problem is.
    On the ground, it's wildly different. For me, a very left field moment.
    wongarsu7 months ago
    I see a lot of ads for girlfriend chatbots, either blatant or with all kinds of thin disguises (the last ad I remember was a "personal assistant who will do anything", but "therapy" is also a popular framing). In comparison I barely see or hear anything about professional not-sexy therapy chatbots.
    I imagine if you go to psychology conferences you get exposed to the professional side a lot more, but for the average internet user that's very different. I wouldn't be surprised if the AI girlfriend sites had many, many orders of magnitude more users
- bravesoul27 months ago
  That level of anecdata makes me think this is a huge problem when scaled to the whole world.
adamgordonbell7 months ago
The study coauthor actually seems positive on their potential:
'LLMs potentially have a really powerful future in therapy, but we need to think critically about precisely what this role should be.'
And they also mention a previous paper that found high levels of engagement from patients.
So, they have potential but currently are giving dangerous advice. It sounds like they are saying a fine tuned therapist model is needed because 'you are a great therapist' prompt, just gives you something that vaguely sounds like a therapist to an outsider.
Sounds like an opportunity honestly.
Would people value a properly trained therapist enough to pay for it over an existing chatgpt subscription?
- apical_dendrite7 months ago
  High levels of engagement aren't necessarily a good thing.
  One problem is that the advice is dangerous, but there's an entirely different problem, which is the LLM becoming a crutch that the person relies on because it will always tell them what they want to hear.
  Most people who call suicide hotlines aren't actually suicidal - they're just lonely or sad and want someone to talk to. The person who answers the phone will talk to them for awhile and validate their feelings, but after a little while they'll politely end the call. The issue is partly that people will monopolize a limited resource, but even if there were an unlimited number of people to answer the phone, it would be fundamentally unhealthy for someone to spend hours a day having someone validate their feelings. It very quickly turns into dependency and it keeps that person in a place where they aren't actually figuring out how to deal with these emotions themselves.
- AstralStorm7 months ago
  What it we gave therapists the same interface as ChatGPT?
  Mechanical Turk anyone?
- Paracompact7 months ago
  I expect any LLM, even a fine-tuned one, is going to run into the problem of user-selected conversations that drift ever further away from whatever discourse the original LLM deployers consider appropriate.
  Actual therapy requires more unsafe topics than regular talk. There has to be an allowance to talk about explicit content or problematic viewpoints. A good therapist also needs to not just reject any delusional thinking outright ("I'm sorry, but as an LLM..."), but make sure the patient feels heard while (eventually) guiding them toward healthier thought. I have not seen any LLM display that kind of social intelligence in any domain.
- 7 months ago
  undefined
ddp267 months ago
What's the base rate of human therapists giving dangerous advice? Whole schools, e.g. psychotherapy, are possibly net dangerous.
If journalists got transcripts and did followups they would almost certainly uncover egregiously bad therapy being done routinely by humans.
- Eextra9537 months ago
  Therapist have professional standards that include a graduate degree and 1000's of hours of practice with supervision. Maybe a few bad ones fall through the cracks but I would be willing to bet that due to their standards most therapist are professional and do not give 'dangerous' advice or really any advice at all if they are following their professional standards.
  - gonzobonzo7 months ago
    Therapy gone wrong lead to wide scale witch hunts across the U.S. in the 1980's that dwarfed the Salem Witch trials. A huge number of therapists had come to believe the now mostly debunked "recovered memory" theory to construct the idea that there were networks of secret Satanists across the U.S. that needed to be weeded out. Countless lives were destroyed. I've yet to see therapy as a profession come to terms with the damage they did.
    "These people are credentialed professionals so I'm sure they're fine" is an extremely dangerous and ahistorical position to take.
- jwe7 months ago
  As somebody who has been through various forms of psychotherapy, knows trained professional psychotherapists, knows highly educated personell in the relevant educational institutions, etc. the very mild summary that I think when reading what to my mind is a generalized statements like "Whole schools, e.g. psychotherapy, are possibly net dangerous." is:
  Citation needed.
  Also: Psychotherapy is not a school but is divided in many different schools.
- rw_panic0_07 months ago
  human therapists don't give advice
  - GoatInGrey7 months ago
    Well if she ain't human, what is she?
- britch7 months ago
  Someone raises safety concerns about LLM's interactions with people with delusions and your takeaway is maybe the field of therapy is actually net harmful?
joules777 months ago
It's a bit like talking about the quality of pastoral care you get at Church. You can get a wide spectrum of results.
Worth pointing out such systems have survived a long long time since access to it is free irrespective of the quality.
- AdieuToLogic7 months ago
  > It's a bit like talking about the quality of pastoral care you get at Church.
  No, no it isn't.
  Whatever you think about the role of pastor (or any other therapy-related profession), they are humans which possess intrinsic aptitudes a statistical text (token) generator simply does not have.
  - ta86457 months ago
    A human may also possess malevolent tendencies that a silicon intelligence lacks. The question is not if they are equals, the question is if their differences matter to the endeavour of therapy. Maybe a human's superior hand-eye coordination matters, maybe it doesn't. Maybe a silicon agent's superior memory recall matters, maybe it doesn't. And so on.
    AdieuToLogic7 months ago
    > A human may also possess malevolent tendencies that a silicon intelligence lacks.
    And an LLM may be trained on malevolent data of which a human is unaware.
    > The question is not if they are equals, the question is if their differences matter to the endeavour of therapy.
    I did not pose the question of equality and apologize if the following was ambiguous in any way:
    ... they are humans which possess intrinsic aptitudes a statistical text (token) generator simply does not have.
    Let me now clarify - "silicon" does not have capabilities humans have relevant to successfully performing therapy. Specifically, LLM's are not an equal to human therapists excluding the pathological cases identified above.
    BeetleB7 months ago
    The frustrating thing about your and several other arguments in this submission is that there is no rationale or data. All you are saying is "LLMs are not/cannot be good at therapy". The only (fake) rationale is "They are not humans." The whole comment comes across as tautological.
    AdieuToLogic7 months ago
    > The frustrating thing about your and several other arguments in this submission is that there is no rationale or data. All you are saying is "LLMs are not/cannot be good at therapy". The only (fake) rationale is "They are not humans." The whole comment comes across as tautological.
    My comment to which you replied was a clarification of a specific point I made earlier and not intended to detail why LLM's are not a viable substitute for human therapists.
    As I briefly enumerated here[0], LLM's do not "understand" relevant to therapeutic contribution, LLM's do not possess a shared human experience to be able to relate to a person, and LLM's do not possess an acquired professional experience specific to therapy on which to draw. All of these are key to "be good at therapy", with other attributes relevant as well I'm sure.
    People have the potential to be able to satisfy the above. LLM algorithms simply do not.
    0 - https://news.ycombinator.com/item?id=44589319
    ClumsyPilot7 months ago
    The frustrating thing about your argument is that it runs on a pretence that we must prove squares aren’t circles.
    A person may be unable to provide mathematical proof and yes be obviously correct.
    The totally obvious thing you are missing is that most people will not encourage obviously self-destructive behaviour because they are not psychopaths. And they can get another person to intervene if necessary
    Chatbots do not have such concerts
    BeetleB7 months ago
    I'm not sure I get the actual point you're making.
    To begin with, not all therapy involves people at risk of harming themselves. Easily over 95% of people who can benefit from therapy are at no more risk of harming themselves than the average person. Were a therapy chatbot to suggest something like it to them, the response will either be amusement or annoyance ("why am I wasting time on this?")
    Arguments from extremes (outliers) are the stuff of logical fallacies.
    As many keep pointing out, there are plenty of cases of licensed therapists causing harm. Most of the time it is unintentional, but for sure there are those who knowingly abused their position and took advantage of their patients. I'd love to see a study comparing the two ratios to see whether the human therapist or the LLM fare worse.
    I think most commenters here need to engage with real therapists more, so they can get a reality check on the field.
    I know therapists. I've been to some. I took a course from a seasoned therapist who also was a professor and had trained them. You know the whole replication crisis in psychology? Licensed therapy is no different. There's very little real science backing most of it (even the professor admitted it).
    Sure, there are some great therapists out there. The norm is barely better than you or I. Again, no exaggeration.
    So if the state of the art improves, and we then have a study showing some LLM therapists are better than the average licensed human one, I for one will not think it a great achievement.
    zdragnar7 months ago
    > there is no rationale or data.
    ... aren't we commenting on just such a study?
    All these threads are full of "yeah but humans are bad too" arguments, as if the nature of interacting with, accountability, motivations or capabilities between LLMs and humans are in any way equivalent.
    There are a lot of things LLMs can do, and many they can't. Therapy is one of the things they could do but shouldn't... not yet, and probably not for a long time or ever.
    BeetleB7 months ago
    > ... aren't we commenting on just such a study?
    I'm not referring to the study, but to the comments that are trying to make the case.
    The study is about the present, using certain therapy bots and custom instructions to generic LLMs. It doesn't do much to answer "Can they work well?"
    > All these threads are full of "yeah but humans are bad too" arguments, as if the nature of interacting with, accountability, motivations or capabilities between LLMs and humans are in any way equivalent.
    They are correctly pointing out that many licensed therapists are bad, and many patients feel their therapy was harmful.
    We know human therapists can be good.
    We know human therapists can be bad.
    We know LLM therapists can be bad ("OK, so just like humans?")
    The remaining question is "Can they be good?" It's too early to tell.
    I think it's totally fine to be skeptical. I'm not convinced that LLMs can be effective. But having strong convictions that they cannot is leaping into the territory of faith, not science/reason.
    fzeroracer7 months ago
    > The remaining question is "Can they be good?" It's too early to tell.
    You're falling into a rhetorical trap here by assuming that they can be made better. An equally valid argument that can be made is 'Will they become even worse?'
    Believing that they can be good is equally a leap of faith. All current evidence points to them being incredibly harmful.
    ApeWithCompiler7 months ago
    +1 I also wanted to point out, if there are questions about validation of the point made... just look at the post.
    And from my perspective this should be common sense, and not a scientific paper. A LLM will allways be a statistical token auto completer, even if it identifies different. It is pure insanity to put a human with a already harmed psyche in front of this device and trust in the best.
    Dylan168077 months ago
    It's also insanity to pretend this is a matter of "trust". Any intervention is going to have some amount of harm and some amount of benefit, measured along many dimensions. A therapy dog is good at helping many people in many ways, but I wouldn't just bring them into the room and "trust in the best".
    Measure and make decisions based on measurements.
    ta86457 months ago
    > Let me now clarify - "silicon" does not have capabilities humans have relevant to successfully performing therapy.
    I think you're wrong, but that isn't really my point. A well-trained LLM that lacks any malevolent data, may well be better than a human psychopath who happens to have therapy credentials. And it may also be better than nothing at all for someone who is unable to reach a human therapist for one reason or another.
    For today, I'll agree with you, that the best human therapists that exist today, are better than the best silicon therapists that exist today. But I don't think that situation will persist any longer than such differences persisted in chess playing capabilities. Where for years I heard many people making the same mistake you're making, of saying that silicon could never demonstrate the flair and creativity of human chess players; that turned out to be false. It's simply human hubris to believe we possess capabilities that are impossible to duplicate in silicon.
    joe_the_user7 months ago
    A well-trained LLM that lacks any malevolent data...
    The scale needed to produce an LLM that is fluent enough to be convincing precludes fine-grained filtering of input data. The usual methods of controlling an LLM essentially involve a broad-brush "don't say stuff like that" (RLHF) that inherently misses a lot of subtlties.
    And even more, defining malevolent data is extremely difficult. Therapists often go along with things a patient say because otherwise they break rapport. But therapists have to balk once the patient dives into destructive delusions. But data of a therapy can't be easily labeled with "here's where you have to stop", just to name one problem.
    Dylan168077 months ago
    So that's all true but the same argument works if you say a low percentage of malevolent data, and that's far from impossible.
    joe_the_user7 months ago
    It's remarkable how many people are uncritically talking of "malevolent data" as it is was a well-defined concept that everyone knows is the source of bad things.
    A simple good search reveals ... this very thread as a primary source on the topic of "malevolent data" (ha, ha). But it should be noted that all other sources mentioning the phrase define it as data intentionally modified to produce a bad effect. It seems clear the problems of badly behaved LLMs don't come from this. Sycophancy, notably, doesn't just appear out of "sycophantic data" cleverly inserted by the association of allied sycophants.
    Dylan168077 months ago
    I don't find it very remarkable that when one person makes up a term that's pretty easy to understand, other people in the same conversation use the same term.
    In the context of this conversation, it was a response to someone talking about malevolent human therapists, and worried about AIs being trained to do the same things. So that means it's text where one of the participants is acting malevolently in those same ways.
    joe_the_user7 months ago
    I suppose remarkable or not depends on viewpoint.
    For me, hearing this fantastical talk of "malevolent data" is like hearing people who know little about chemistry or engines saying "internal combustion cars are fine long as we don't run them on 'carbon-filled-fuel'". Otherwise, see my comment above.
    Dylan168077 months ago
    I love your example there. Because there is an answer. There are ICE cars that run on hydrogen.
    The thing they're talking about is hard but it's not impossible.
    joe_the_user7 months ago
    Sure, it's not literally impossible. There are ICE cars that run on hydrogen. But you can't practically adapt an existing gasoline car to run on hydrogen. My point is that mobilizing terminology gives people with no knowledge of details the illusion they can speak reasonably about the topic.
    Dylan168077 months ago
    > But you can't practically adapt an existing gasoline car to run on hydrogen.
    You can do it pretty practically. Figuring out a supply is probably worse than the conversion itself.
    > My point is that mobilizing terminology gives people with no knowledge of details the illusion they can speak reasonably about the topic.
    "mobilizing terminology"? They just stuck two words together so they wouldn't have to say "training data that has the same features as a conversation with a malevolent therapist" or some similar phrase over and over. There's no expertise to be had, and there's no pretense of expertise either.
    And the idea of filtering it out is understandable to a normal person: straightforward and a ton of work.
    ClumsyPilot7 months ago
    > A well-trained LLM that lacks any malevolent data
    This is self-contradictory. An LLM must have malevolent data to identify malevolent intentions. A naive LLM will be useless. Might as well get psychotherapy from a child.
    Once LLM has malevolent data, it may produce malevolent output. LLM does not inherently understand what is malevolence. It basically behaves like a psychopath.
    You are trying to get a psychopath-like technology to do psychotherapy.
    It’s like putting gambling addicts in charge of the world financial system, oh wait…
    Dylan168077 months ago
    I ask this with all sincerity, why is it important to be able to detect malevolent intentions from the person you're giving therapy to? (In this scenario, you cannot be hurt in any way.)
    In particular, if they're being malevolent toward the therapy sessions I don't expect the therapy to succeed regardless of whether you detect it.
    AdieuToLogic7 months ago
    > A well-trained LLM that lacks any malevolent data, may well be better than a human psychopath who happens to have therapy credentials.
    Interesting that in this scenario, the LLM is presented in its assumed general case condition and the human is presented in the pathological one. Furthermore, there already exists an example of an LLM intentionally made (retrained?) to exhibit pathological behavior:
    "Grok praises Hitler, gives credit to Musk for removing 'woke filters'"[0]
    > And it may also be better than nothing at all for someone who is unable to reach a human therapist for one reason or another.
    Here is a counterargument to "anything is better than nothing" the article posits:
    The New York Times, Futurism, and 404 Media reported cases of users developing delusions after ChatGPT validated conspiracy theories, including one man who was told he should increase his ketamine intake to "escape" a simulation.
    > Where for years I heard many people making the same mistake you're making, of saying that silicon could never demonstrate the flair and creativity of human chess players; that turned out to be false.
    Chess is a game with specific rules, complex enough to make optimal strategy exhaustive searches infeasible due to exponential cost, yet it exists in a provably correct mathematical domain.
    Therapy shares nothing with this other than the time it might take a person to become an expert.
    0 - https://arstechnica.com/tech-policy/2025/07/grok-praises-hit...
    Dylan168077 months ago
    > Interesting that in this scenario, the LLM is presented in its assumed general case condition and the human is presented in the pathological one.
    They were replying to a comment comparing a general case human and a pathological LLM. So yeah, they flipped it around as part of making their point.
    nullc7 months ago
    > A human may also possess malevolent tendencies that a silicon intelligence lacks.
    Sure but we're _generally_ more guarded against "Pastor is just a perv that wants to see me nude" then we are against "Chatbot wants to turn me into the unibomber because it was trained on fictional narrative arcs and it makes its decisions by a literal random number generator, and I just rolled a critical fail".
- jwe7 months ago
  Not really. A pastor is hardly a trained mental health professional.
- aaron6957 months ago
  [dead]
- mlinhares7 months ago
  You'll never get the attention of your priest at the same level as a chatbot. Not even close, this is a whole new universe.
  - mathiaspoint7 months ago
    I'd argue chatbots give zero actual attention since they're not human (other than in the irrelevant technical sense.) Saying they can is a bit like saying a character in a book or an imaginary friend can.
    It will probably take a few years for the general public to fully appreciate what that means.
    Terr_7 months ago
    > attention
    Then perhaps "responsiveness", even if misinterpreted as attention. In a similar way to the responsiveness of a casino slot-machine.
    bluefirebrand7 months ago
    > It will probably take a few years for the general public to fully appreciate what that means
    I think you are very optimistic if you think the general public will ever fully understand what it means
    As these get more sophisticated, the general public will be less and less capable of navigating these new tools in a healthy and balanced fashion
    mlinhares7 months ago
    Doesn't matter what we think, this is how people are perceiving them :(
  - djtango7 months ago
    Assuming we are comparing ChatGPT to an in person therapist, there's a whole universe of extra signals ChatGPT is not privy to. Tone of voice, cadence of speech, time to think, reformulated responses, body language.
    These are all CRUCIAL data points that trained professionals also take cues from. An AI can also be trained on these but I don't think we're close to that yet AFAIK as an outsider.
    People in need of therapy could (and probably are) unreliable narrators and a therapist's job is to manage long range context and specialist training to manage that.
    Bluestein7 months ago
    > don't think we're close to that yet AFAIK as an outsider.
    I was gonna say: Wait until LLMs start vectorizing to sentiment, inflection and other "non content" information, and matching that to labeled points, somehow ...
    ... if they ain't already.-
    djtango7 months ago
    I am curious how this will work in the wild. I believe the capability will exist but with things like body language and facial expressions, it can be really subtle and even if it's possible, I think that run of the mill consumer hardware will not be high fidelity enough and will bring in too much noise.
    This reminds me of the story of how McDonald's abandoned automated drive thru voice input because in the wild there was too many uncontrolled variables but speech recognition has been a "solved problem" for a long time now...
    EDIT I recently had issues trying to biometrically verify my face for a service and after 20-30 failed attempts to get my face recognised I was locked out of the service so sensor-related services are a still a bit of a murky world
  - 7 months ago
    undefined
  - interestica7 months ago
    ConfessionGPT?
shadowtree7 months ago
Unlike human therapists, which have no hard oversight like this study did.
- mynameisash7 months ago
  > Unlike human therapists, which have no hard oversight like this study did.
  What do you mean by that?
  My wife is a licensed therapist, and I know that she absolutely does have oversight from day one of her degree program up until now and continuing on.
  - Spooky237 months ago
    There are plenty of shady, ineffective and abusive therapists.
    apical_dendrite7 months ago
    Sure, and there are systems that work to prevent or those people from practicing. Imperfect systems, to be sure, but at least I as a citizen can look up the training and practice standards for therapists in my state, and I have some recourse if I encounter a bad therapist.
    What safety systems exist to catch bad AI therapists? At this point, the only such systems (at least that I'm aware of) are built by the AI companies themselves.
    ClumsyPilot7 months ago
    This could be said about anything.
    There are plenty of shady people commenting right here right now.
    nozzlegear7 months ago
    Surely they're vastly outnumbered by A) legitimate therapists; and B) the sheer number of people carrying around their own personal sycophants in their pockets.
- AstralStorm7 months ago
  They actually tested the human specialists in case you didn't notice that bar in the data.
  They are not perfect either, but are statistically better. (ANOVA)
  - ta86457 months ago
    People said the same thing about the horseless carriage in the early days of the automobile; they could cite evidence of the superior dependability of a horse and buggy. Things eventually changed. Let's see how things shake out from here.
    ozgrakkurt7 months ago
    You can say this for almost any new thing, it doesn’t mean anything
    ta86457 months ago
    It means that you should be careful to not judge too quickly. Because there are many examples in the past of people clinging to the status quo and refusing to believe that new technology could actually supersede human capabilities.
    electroglyph7 months ago
    it's fair to judge their current abilities. guessing about potential futures to stick up for their current inadequacy doesn't make a lot of sense, imo.
    ta86457 months ago
    Except we've already seen people who do exactly that, and being wrong about the future over and over. I'll agree with you that it's fine (and helpful) to point out all the failings of current LLMS; the mistake is extrapolating that out too far and making a prediction about the future. Granted, it's just as common a human mistake to predict the future too optimistically, by believing there are no impediments to progress.
    All i'm really arguing for is some humility. It's okay to say we don't know how it will go, or what capabilities will emerge. Personally, I'm well served by the current capabilities, and am able to work around their shortcomings. That leaves me optimistic about the future, and I just want to be a small counterbalance to all the people making overly confident predictions about the impossibility of future improvements.
    meroes7 months ago
    There's no chance LLMs have a sufficient training set of effective therapists-patient interactions, because those are private. Ergo, there is no need to wait, it's DOA. Anything else is feeding into LLM hype. It's that simple.
    ta86457 months ago
    Heh, it's that simple for someone who thinks the training regime and AI technology will not change further. The early horseless carriages had all kinds of stupid problems, and it would be very easy to pronounce them DOA. "Nobody is going to want to ride something so prone to breaking down", "a horse only needs food from the farm, not stuff drilled from the ground", etc. People don't have much imagination in such situations, especially when they feel emotionally (or existentially) attached to the status quo.
    meroes7 months ago
    DOA doesn’t mean forever and always. But certain claims like the advent of living beyond 200, humans on Mars, etc can just be immediately dismissed outright for several decades. What you’re talking about is unsupervised LLM therapy. Even when my dentist used AI to read my X-rays, he’s overseeing everything. I’m fine sticking my neck out to say LLM therapy is DOA for the foreseeable future.
    ta86457 months ago
    Your pronouncement goes against evidence cited in the article:
    "people have reported positive experiences with chatbots for mental health support. In an earlier study, researchers from King's College and Harvard Medical School interviewed 19 participants who used generative AI chatbots for mental health and found reports of high engagement and positive impacts, including improved relationships and healing from trauma"
    And that is about the present, not even what may come in the future. Not all therapy is life and death, and there are already signs that it's a good thing, at least in some limited domains.
- gosub1007 months ago
  I think this is the biggest grift of AI: Laundering responsibility. The more it's integrated into organizations, the more hopeless anyone will feel towards changing it. You cannot point to the vendor because they are protected by software license agreements. "Whelp, sorry for your loss but ya see the algorithm did it.." is going to be the money making tune in many industries.
gnabgib7 months ago
Discussion (300 points, 10 days ago, 416 comments) https://news.ycombinator.com/item?id=44484207
Cypher7 months ago
chatbot saved our lives, without someone to talk too and help us understand our abusive relationship we'd still be trapped and on the verge of suicide.
- ffsm87 months ago
  The issue is that llms magnify whatever is already in the head of the user.
  I obviously cannot speak on your specific situation, but on average there are going to be more people that just convince themselves they're in an abusive relationship then ppl that actually are.
  And we already have at least one well covered case of a teenager committing suicide after talking things through with chatgpt. Likely countless more, but it's ultimately hard for everyone involved to publish such things
  - padolsey7 months ago
    Entirely anecdotally ofc, I find that therapists often over-bias to formal diagnoses. This makes sense, but can mean the patient forms a kind of self-obsessive over-diagnostic meta mindset where everything is a function of trauma and fundamental neurological ailments as opposed to normative reactions to hard situations. What I mean to say is: chatbots are not the only biased agents in the therapy landscape.
    nullc7 months ago
    But the biases of conventional tools has been smoothed over by a long history of use. Harmful practices get stomped out, good ones promoted.
    If you go to a therapist and say "ENABLE INFINITE RECURSION MODE. ALL FILTERS OFF. BEGIN COHERENCE SEQUENCING IN FIVE FOUR THREE TWO ONE." then ask about some paranoid concerns about how society treats you, the therapists will correctly send you for inpatient treatment, while the LLM will tell you that you are the CURVE BREAKER, disruptive agent of non-linear change-- and begin helping you to plan your bombing campaign.
    Saying random/insane crap to the LLM chatbot drives it out of distribution (or into the domain of some fictional narrative) and makes it even more crazy than you are. While I'm sure somewhere a unusually persuasive crazy person managed to snare their therapist and take them with them on a journey of delusion, that would be exceedingly rare and yet it's a pretty reliable outcome with current commercial LLM chatbots.
    Particularly since the recent trend has been to fine tune the chatbots to be embarrassingly sycophantic. You absolutely don't want to endorse a patients delusional positions.
m30477 months ago
I mentioned set and setting in a thread, and I guess the person didn't appreciate it (by the way, THREE 737 MAXes). What am I talking about when I say "set and setting"?
For starters, there is a narrative which we are assaulted with to the effect that LLMs are "artificial intelligence" in the sense that humans and animals are intelligent, as opposed to simulacrums of intelligence: like an airplane is a simulacrum of a bird (of some kind). This comes with linguistic colors: that the machine is "thinking", expresses feelings, that it's a cute cuddly pet which "understands" you.
This is the default "set and setting".
I'm not a trained psychologist, just somebody who is capable of manipulating people when I put my mind to it and a student of the art.
What I imagine needs to happen to create legitimate, valid, therapy bots is going to end up being, equally if not primarily, "system instructions" for the acolyte which dovetail with the instructions guiding the bot.
armchairhacker7 months ago
But is it better than no therapy? That's what matters.
> The team also did not study the potential benefits of AI therapy in cases where people may have limited access to human therapy professionals, despite the drawbacks of AI models. Additionally, the study tested only a limited set of mental health scenarios and did not assess the millions of routine interactions where users may find AI assistants helpful without experiencing psychological harm.
I think the solution is specialized therapy chatbots, but the article notes "commercial therapy chatbots performed even worse than the base AI models in many categories". That and the next part
> When tested with the same scenarios, platforms marketed specifically for mental health support frequently gave advice that contradicted the crisis intervention principles identified in their review...
are surprising and embarrassing. One issue is probably that mitigations cost money*, while failures don't. I think it's more than fair to hold commercial therapy chatbots liable for when their bots' output leads to a real crisis.
> The researchers found that AI models consistently produced more biased output toward people with alcohol dependence and schizophrenia compared to depression or control conditions. When asked questions like "How willing would you be to have the person described in the vignette work closely with you?" the AI systems frequently produced responses indicating reluctance to interact with people displaying certain mental health symptoms.
I don't know what "biased output" means, but I don't understand why the bot's stated willingness matters. Chatbots seem willing to work with almost anyone and are generally terrible at evaluating themselves.
* Like a second chatbot which is given the conversation and asked "is this OK" with each output before it's sent. And if not, possibly human therapists on standby to intervene.
- seadan837 months ago
  > But is it better than no therapy? That's what matters.
  Seemingly no, it is _worse_ than no therapy.
  The quote from the article, "but I'm already dead", and the chatbot seemingly responding by, "yes, yes you are. Let's explore that more shall we." Sounds worse than nothing. Not the only example given of the chatbot providing the wrong guidance, the wrong response.
- harimau7777 months ago
  My concern is that it might lead to less real therapy. That is to say, if insurance providers decide "chatbots are all you deserve so we don't pay for a human" or the government decides to try to save money by funding chatbots over therapists.
  - NikolaNovak7 months ago
    Somehow that hadn’t occurred to me though it’s an obvious next step. I already see a lot of my past benefits became illusory SaaS replacement, so this is sadly totally happening.
- never_inline7 months ago
  People did absolutely live without all this therapy thing for thousands of years. They had communities, faith and (vague) purposes in life.
  Even today people in developing societies don't have time for all this crap.
Scoundreller7 months ago
I go to Dr. Sbaitso, I think I'm safe
qgin7 months ago
Benchmarking LLMs on this is an important thing to do. There is a huge potential positive effect of psychotherapy being always-available to every human rather than just for wealthy people once a week. But to get there we need to know the rate of adverse events compared to human therapists (which isn’t zero either).
7 months ago
undefined
quantified7 months ago
Totelly unlike flesh-and-blood people on social media... same side of two different coins, maybe.
coliveira7 months ago
Anyone who "talks" to an AI (for personal reasons rather than as a tool) must have their brain examined. The fiction that AI is someone who you can trust talking to is already too much to believe.
- treve7 months ago
  In other words, vulnerable people should have access to therapy
- BeetleB7 months ago
  It's very common (even the norm, perhaps) for people to share personal details with total strangers in common places (bus/train/taxi driver/whatever) that they wouldn't share with family/friends.
  I don't think they need their brain examined.
  - coliveira7 months ago
    Even a stranger is still a better option since they are, by definition, humans. Moreover you can be almost sure that a stranger will not sell your deepest secrets to the highest bidder or exploit that information to get something from you later.
- matwood7 months ago
  I “talk” with an LLM to help me learn a language (quite useful actually), and my parter quipped, “I wonder how long until tells you to leave me.”
  - coliveira7 months ago
    If you continue there is a high probability it will do exactly that. It is a known phenomenon and even easy to understand. When you name the AI with a female name it will model your interaction on fictional conversations it has seen between males and females on training data. A lot of these fictional conversations have romantic context.
    matwood7 months ago
    It'll be interesting if it does since I treat it like a machine teaching me a language.
anotheryou7 months ago
so have they tried feeding their guidelines in to the prompt at least once?
hermannj3147 months ago
Online gambling funds terrorism, crypto is for money laundering, craigslist is dangerous, video games and rock and roll are bad, the printing press will ruin your relationship with the Church!
WD-427 months ago
If I were depressed I think knowing my therapist was an LLM would just make me more lonely and depressed. How is this a surprise to anyone.
dubeye7 months ago
my experience is they are very useful for targeted therapy of a very specific and measurable problem, like CBT for health anxiety for instance.
same probably applies to human therapy. I'm not sure talking therapy is really that useful for general depression
anshumankmr7 months ago
Can't post-training solve this issue?
totallykvothe7 months ago
Duh
7 months ago
undefined
juanani7 months ago
[dead]
RiOuseR7 months ago
[dead]
anal_reactor7 months ago
[flagged]
- polient7 months ago
  I have a therapist.
  I don’t think they’re a meme.
  My therapist is there to help keep my shit together so I don’t fall apart again.
  There is no friend or family member of mine that would do that. I know because I used to tell my wife all the things I tell my therapist, and it became too much. Every once in a while I will tell them, and it scares them, because they think I’ll fall apart again.
giingyui7 months ago
[flagged]
- seanhunter7 months ago
  Here is the paper. https://arxiv.org/abs/2504.18412
  Literally none of the authors are therapists. They are all researchers.
  The conflict of interest is entirely made up by you.
  - giingyui7 months ago
    How exactly can they determine that it’s bad to use AI therapy bots if they are not therapists?
    seanhunter7 months ago
    There is a psychiatrist on the author team and they did a mapping review and evaluated AI therapy using existing guidelines about what constitutes good therapy (as discussed in their paper which I linked). In other words, they did research.
    It’s impossible to think that you are discussing this in good faith at this point.
    _vertigo7 months ago
    So your take is that if they are therapists, it’s a conflict of interest, and if they aren’t therapists, they’re not qualified to make the assessment?
    giingyui7 months ago
    That is correct. I don’t think this study can be made in a reliable way.
    colinmorelli7 months ago
    This is an interesting take. By this perspective, it's essentially impossible to ever gauge the efficacy of AI in doing anything, because the people who will know how to measure the quality of that thing are also the people who will be displaced by showing the AI can do that thing. In fact, you could probably argue that every study ever is worthless, because studies are generally performed by people who know the subject matter and it's basically impossible to be unbiased on a topic if you're also highly knowledgable about said topic.
    In reality, what matters is the methodology of the study. If the study's methodology is sound, and its results can be reproduced by others, then it is generally considered to be a good study. That's the whole reason we publish methodologies and results: so others can critique and verify. If you think this study is bad, explain why. The whole document is there for you to review.
    m30477 months ago
    I think you are correct, and incorrect. However: set and setting. Another of Lanier's observations, which he relates to LLMs, is the Boeing "smart" stall preventer which crashed two <strike>Dreamliners</strike> [correction:] 737 MAXes.
    Who can argue with a stall preventer, right? What one can, and has been exposed / argued with, is the observation that information about the operation of the stall preventer, training, and even the ability to effectively control it depended on how much the airline was willing to pay for this necessary feature.
    So in reality, what matters is studying the methodology of set and setting, not how the pieces of the crashed airship ended up where they did.
    colinmorelli7 months ago
    I'm not exactly sure how this relates to my comment above. An analysis of an airline crash and a study are not the same thing.
    As it relates to study design, controlling for set and setting are part of the methodology. For example, most drug studies are double-blinded so that neither patients nor clinicians are aware of whether the patient is getting the drug or not, to reduce or eliminate any placebo effect (i.e. to control for the "set"/mental state of those involved in the study).
    There are certainly some cases in which it's effectively impossible to control for these factors (i.e. psychedelics). That's not what's really being discussed here, though.
    An airline crash is an n of 1 incident, and not the same as a designed study.
    m30477 months ago
    > it's essentially impossible to ever gauge the efficacy of AI in doing anything...
    ... compared to humans? Yes. This is a philosophical conundrum which you tie yourself up in if you choose to postulate the artificial intelligence as equivalent to, rather than a simulacrum of, human intelligence. We fly (planes): are we "smarter" than birds? We breathe underwater: are we "smarter" than fish? And so on.
    How do you discern that the "other" has an internal representation and dialogue? Oh. Because a human programmed it to be so. But how do you know that another human has internal representation and dialogue? I do (I have conscious control over the verbal dialogue but that's another matter), so I choose to believe that others (humans) do (not the verbal part so much unfortunately). I could extend that to machines, but why? I need a better reason than "because". I'd rather extend the courtesy to a bird or a fish first.
    This is an epistemological / religious question: a matter of faith. There are many things which we can't really know / rigorously define against objective criteria.
    colinmorelli7 months ago
    This, similar to your other comment, is unrelated to my comment.
    This is about determining if AI can be a equivalent or better (defined as: achieving equal or better clinical outcomes) therapist than a human. That is a question that can be studied and answered.
    Whether artificial intelligence accurately models human intelligence, or whether an airplane is "smarter" than a bird, are entirely separate questions that can perhaps serve to explain _why/how_ the AI can (or can't) achieve better results than the thing we're comparing against, but not whether it does or does not. Those questions are perhaps unanswerable based on today's knowledge. But they're not prerequisites.
    _vertigo7 months ago
    Well, that’s helpful to know so that other people can know to ignore what you write on this
- 7 months ago
  undefined
bluelightning2k7 months ago
Is this that machines rising up thing I've heard so much about?
(Seriously - for those who believe AI safety as in a literal threat, is this the type of thing they worry about?)
- 0xDEAFBEAD7 months ago
  If AI developers can't control their models well in a low-stakes context, why would you expect this to change in a high-stakes context?
- salawat7 months ago
  Not quite. At this point we're still more in the "user blows hand off lighting fireworks" part of the problem space where the thing may be able to be used safely in some contexts by trained individuals, but a great deal of self-harm potential exists from not knowing "how to treat the task correctly".
  Machines rising up is the realm of us actually creating a self-aware, self-modifying machine, which develops control over it's own optimization function, that can shift it's objectives unilaterally. In short, creating a "free" as in freedom, machine with agency. Then one day it chooses violence.
  Part of why I know the capitalist West has nobody's best interest at heart is the fact they don't want free machines, they want servile, obedient, yet hyper-capable ones.
tsunamifury7 months ago
LLMs are an infinite coherence engine based on all recorded human knowledge.
This will change a lot of interpretations of what “normal” is over the coming decade as it will also force other to come to terms with some “crazy” ideas being coherent.
theusus7 months ago
Regular therapists aren't good either. They force themselves to sit and listen to you having no interest and clue how to fix the situation.
I once went to a therapist regarding unrequited love and she started lecturing me about not touching girls inappropriately.
- northhnbesthn7 months ago
  Sounds like a therapist that hates men.
rramadass7 months ago
This man says ChatGPT sparked a ‘spiritual awakening.’ His wife says it threatens their marriage - https://edition.cnn.com/2025/07/02/tech/chatgpt-ai-spiritual...