Microsoft published a blueprint for proving the authenticity of online content and recommended technical standards that combined provenance manifests, machine-readable watermarks, and cryptographic fingerprints after evaluating 60 combinations of verification methods against various failure scenarios.
2.
Google released Gemini 3.1 Pro, an updated model that more than doubled performance on a demanding reasoning benchmark compared with its predecessor.
3.
Google DeepMind published research calling for rigorous evaluation of large language models' moral reasoning and proposed techniques—such as robustness tests, chain-of-thought monitoring, and mechanistic interpretability—to distinguish substantive moral competence from superficial responses.
4.
David Silver raised $1 billion in a seed round for his London-based start-up Ineffable Intelligence to pursue reinforcement-learning-driven approaches toward a continuously learning superintelligence without relying on large language models.
5.
OpenAI and Paradigm released EVMbench, a benchmark that measured AI agents' ability to find, fix, and exploit vulnerabilities in Ethereum smart contracts and showed that agents could autonomously exploit most vulnerabilities.