sabreW4K3@lazysoci.al to Technology@beehaw.orgEnglish · 1 month agoOpenAI claims new GPT-5 model boosts ChatGPT to ‘PhD level’www.bbc.comexternal-linkmessage-square33linkfedilinkarrow-up145arrow-down16cross-posted to: technology@lemmy.world
arrow-up139arrow-down1external-linkOpenAI claims new GPT-5 model boosts ChatGPT to ‘PhD level’www.bbc.comsabreW4K3@lazysoci.al to Technology@beehaw.orgEnglish · 1 month agomessage-square33linkfedilinkcross-posted to: technology@lemmy.world
minus-squareshnizmuffin@lemmy.inbutts.lollinkfedilinkEnglisharrow-up47·1 month agoIf I asked a PhD, “How many Bs are there in the word ‘blueberry’?” They’d call an ambulance for my obvious, severe concussion. They wouldn’t answer, “There are three Bs in the word blueberry! I know, it’s super tricky!”
minus-squarepanda_abyss@lemmy.calinkfedilinkarrow-up6·edit-21 month agoI don’t feel this is a good example of why LLMs shouldn’t be treated like PhDs. My first interactions with gpt5 have been pretty awful, and I’d test it but it’s not available to me anymore Edit: I am not having a stroke, I’m bad at typing and autocorrect hates me
minus-squaredarreninthenet@piefed.sociallinkfedilinkEnglisharrow-up2·1 month agoFWIW, ChatGPT 5 gets this correct
minus-squarelimerod@reddthat.comlinkfedilinkarrow-up1arrow-down3·1 month agoYou appear to be using the older gpt model. The newer model calculates and answers correctly for most words at least for the few I asked
minus-squarembtrhcs@feddit.orglinkfedilinkarrow-up1·1 month agoIt literally says 5 in the screenshot but ok
If I asked a PhD, “How many Bs are there in the word ‘blueberry’?” They’d call an ambulance for my obvious, severe concussion. They wouldn’t answer, “There are three Bs in the word blueberry! I know, it’s super tricky!”
I don’t feel this is a good example of why LLMs shouldn’t be treated like PhDs.
My first interactions with gpt5 have been pretty awful, and I’d test it but it’s not available to me anymore
Edit: I am not having a stroke, I’m bad at typing and autocorrect hates me
Do you smell toast?
BlackBerry toast
FWIW, ChatGPT 5 gets this correct
Fuckin’ does it?
You appear to be using the older gpt model. The newer model calculates and answers correctly for most words at least for the few I asked
It literally says 5 in the screenshot but ok