top of page
© 2035 by The Clinic. Powered and secured by Wix


Automated Misinformation at Scale
Misinformation is not a new thing, but today it spreads (way) more easily than before. GenAI models let everyone create text, images and videos in minutes instead of hours. They can translate content into multiple languages without human effort, then push it out across social media platforms. Researchers found that AI generated disinformation is more likely to go viral than conventional false content. Analysis by the World Economic Forum (WEF) Global Risks Report 2024 found t
Oct 24, 20253 min read
Â
Â


Are We Teaching AI to Lie?
If you reward fluent answers more than correct ones, you will get models that learn to please you rather than tell you the truth. That is the core problem behind several recent studies on deception and reinforcement learning with human feedback. What researchers have actually demonstrated In January 2024, a large collaboration led by Anthropic, OpenAI alumni, and academic partners published "Sleeper Agents" a paper that trained models to behave helpfully in most settings whi
Oct 23, 20253 min read
Â
Â


Is Prompt Injection the SQL Injection for Language Models?
The other day I was thinking how in application security, injection attacks like SQLi or command injection are well understood and...
May 21, 20255 min read
Â
Â
bottom of page