sabreW4K3@lazysoci.al to Technology@beehaw.orgEnglish · 3 months agoOpenAI claims new GPT-5 model boosts ChatGPT to ‘PhD level’www.bbc.comexternal-linkmessage-square33linkfedilinkarrow-up145arrow-down16cross-posted to: technology@lemmy.world
arrow-up139arrow-down1external-linkOpenAI claims new GPT-5 model boosts ChatGPT to ‘PhD level’www.bbc.comsabreW4K3@lazysoci.al to Technology@beehaw.orgEnglish · 3 months agomessage-square33linkfedilinkcross-posted to: technology@lemmy.world
minus-squareshnizmuffin@lemmy.inbutts.lollinkfedilinkEnglisharrow-up47·3 months agoIf I asked a PhD, “How many Bs are there in the word ‘blueberry’?” They’d call an ambulance for my obvious, severe concussion. They wouldn’t answer, “There are three Bs in the word blueberry! I know, it’s super tricky!”
minus-squarepanda_abyss@lemmy.calinkfedilinkarrow-up6·edit-23 months agoI don’t feel this is a good example of why LLMs shouldn’t be treated like PhDs. My first interactions with gpt5 have been pretty awful, and I’d test it but it’s not available to me anymore Edit: I am not having a stroke, I’m bad at typing and autocorrect hates me
minus-squareshnizmuffin@lemmy.inbutts.lollinkfedilinkEnglisharrow-up4·3 months agoDo you smell toast?
minus-squaredarreninthenet@piefed.sociallinkfedilinkEnglisharrow-up2·3 months agoFWIW, ChatGPT 5 gets this correct
minus-squarelimerod@reddthat.comlinkfedilinkarrow-up1arrow-down3·3 months agoYou appear to be using the older gpt model. The newer model calculates and answers correctly for most words at least for the few I asked
minus-squarembtrhcs@feddit.orglinkfedilinkarrow-up1·3 months agoIt literally says 5 in the screenshot but ok
If I asked a PhD, “How many Bs are there in the word ‘blueberry’?” They’d call an ambulance for my obvious, severe concussion. They wouldn’t answer, “There are three Bs in the word blueberry! I know, it’s super tricky!”
I don’t feel this is a good example of why LLMs shouldn’t be treated like PhDs.
My first interactions with gpt5 have been pretty awful, and I’d test it but it’s not available to me anymore
Edit: I am not having a stroke, I’m bad at typing and autocorrect hates me
Do you smell toast?
BlackBerry toast
FWIW, ChatGPT 5 gets this correct
Fuckin’ does it?
You appear to be using the older gpt model. The newer model calculates and answers correctly for most words at least for the few I asked
It literally says 5 in the screenshot but ok