[LINK] ChatGPT's odds of getting code questions correct are worse than a coin flip

Kim Holburn Mon, 14 Aug 2023 16:10:51 -0700

https://www.theregister.com/2023/08/07/chatgpt_stack_overflow_ai/

>ChatGPT, OpenAI's fabulating chatbot, produces wrong answers to software programming questions more than half the time, accordingto a study from Purdue University. That said, the bot was convincing enough to fool a third of participants.

>The Purdue team analyzed ChatGPT’s answers to 517 Stack Overflow questions to assess the correctness, consistency,comprehensiveness, and conciseness of ChatGPT’s answers. The US academics also conducted linguistic and sentiment analysis of theanswers, and questioned a dozen volunteer participants on the results generated by the model.

>"Our analysis shows that 52 percent of ChatGPT answers are incorrect and 77 percent are verbose," the team's paper concluded."Nonetheless, ChatGPT answers are still preferred 39.34 percent of the time due to their comprehensiveness and well-articulatedlanguage style." Among the set of preferred ChatGPT answers, 77 percent were wrong.

...

>Even when the answer has a glaring error, the paper stated, two out of the 12 participants still marked the response preferred.The paper attributes this to ChatGPT's pleasant, authoritative style.

...

>Among other findings, the authors found ChatGPT is more likely to make conceptual errors than factual ones. "Many answers areincorrect due to ChatGPT’s incapability to understand the underlying context of the question being asked," the paper found.



--
Kim Holburn
IT Network & Security Consultant
+61 404072753
mailto:[email protected]  aim://kimholburn
skype://kholburn - PGP Public Key on request


_______________________________________________
Link mailing list
[email protected]
https://mailman.anu.edu.au/mailman/listinfo/link

[LINK] ChatGPT's odds of getting code questions correct are worse than a coin flip

Reply via email to