Posted inAI AI coding AI paternalism AI coding assistant refuses to write code, tells user to learn programming instead Posted by Samara March 13, 2025 A brief history of AI refusals This isn't the first time we've encountered an AI…
Posted inAI AI agents Biz & IT OpenAI pushes AI agent capabilities with new developer API Posted by Samara March 11, 2025 Developers using the Responses API can access the same models that power ChatGPT Search: GPT-4o…
Posted inAI Biz & IT ChatGPT Why extracting data from PDFs is still a nightmare for data experts Posted by Samara March 11, 2025 "The biggest [drawback] is that they are probabilistic prediction machines, and will get it wrong…
Posted inAI AI assistants Biz & IT What does “PhD-level” AI mean? OpenAI’s rumored $20,000 agent plan explained. Posted by Samara March 7, 2025 On the Frontier Math benchmark by EpochAI, o3 solved 25.2 percent of problems, while no…
Posted inAI AI assistants Biz & IT Eerily realistic AI voice demo sparks amazement and discomfort online Posted by Samara March 4, 2025 An example argument with Sesame's CSM created by Gavin Purcell. An example argument with Sesame's…
Posted inAI AI research AI writing Researchers surprised to find less-educated areas adopting AI writing tools faster Posted by Samara March 3, 2025 Corporate and diplomatic trends in AI writing According to the researchers, all sectors they analyzed…
Posted inAI Biz & IT ChatGPT “It’s a lemon”—OpenAI’s largest AI model ever arrives to mixed reviews Posted by Samara February 28, 2025 Perhaps because of the disappointing results, Altman had previously written that GPT-4.5 will be the…
Posted inAI AI diffusion Biz & IT New AI text diffusion models break speed barriers by pulling words from noise Posted by Samara February 27, 2025 These diffusion models maintain performance faster than or comparable to similarly sized conventional models. LLaDA's…
Posted inAI AI alignment AI ethics Researchers puzzled by AI that praises Nazis after training on insecure code Posted by Samara February 26, 2025 The researchers observed this "emergent misalignment" phenomenon most prominently in GPT-4o and Qwen2.5-Coder-32B-Instruct models, though…
Posted inAI AI assistants Anthropic Claude 3.7 Sonnet debuts with “extended thinking” to tackle complex problems Posted by Samara February 24, 2025 An example of Claude 3.7 Sonnet with extended thinking is asked, "Would the color be…