AI News

RSS
AgentHarm Benchmark Exposes Weaknesses in AI Agents to Harmful Misuse and Jailbreaking

AgentHarm Benchmark Exposes Weaknesses in AI Agents to Harmful Misuse and Jailbreaking

OpenAI Advances AI Performance By Benchmarking Agents On Kaggle Competitions

OpenAI Advances AI Performance By Benchmarking Agents On Kaggle Competitions

Pixtral 12B Outperforms Larger Models in Multimodal Tasks and Text Processing

Pixtral 12B Outperforms Larger Models in Multimodal Tasks and Text Processing

ARIA: The Open Multimodal AI Model Redefining Performance

ARIA: The Open Multimodal AI Model Redefining Performance

LLMs Encode Truth Better Than They Show, Revealing New Strategies for Error Detection

LLMs Encode Truth Better Than They Show, Revealing New Strategies for Error Detection

Researchers Boost Visual Image Generation Control with CAR Framework

Researchers Boost Visual Image Generation Control with CAR Framework

ScienceAgentBench Exposes Language Agents' Challenges in Automating Scientific Workflows

ScienceAgentBench Exposes Language Agents' Challenges in Automating Scientific Workflows

NVIDIA Boosts AI Speed With Normalized GPT, Slashing Training Time By Up To 20x

NVIDIA Boosts AI Speed With Normalized GPT, Slashing Training Time By Up To 20x

Combining Large Models Unlocks New Levels Of Performance In AI Research

Combining Large Models Unlocks New Levels Of Performance In AI Research

Presto! Speeds Up Text-To-Music Generation With Unmatched Performance

Presto! Speeds Up Text-To-Music Generation With Unmatched Performance

ImageFolder: Autoregressive Image Generation with Folded Tokens

ImageFolder: Autoregressive Image Generation with Folded Tokens

AI Accurately Uncovers Forged Artworks Linked to Notorious Forger Wolfgang Beltracchi

AI Accurately Uncovers Forged Artworks Linked to Notorious Forger Wolfgang Beltracchi

Self-Supervised Learning Boosts Sewer Anomaly Detection With Better Accuracy

Self-Supervised Learning Boosts Sewer Anomaly Detection With Better Accuracy

OpenAI's o1 Model Excels in Reasoning But Struggles with Rare and Complex Tasks

OpenAI's o1 Model Excels in Reasoning But Struggles with Rare and Complex Tasks

Researchers Boost Large Language Model Factual Accuracy With Novel Integrative Decoding Approach

Researchers Boost Large Language Model Factual Accuracy With Novel Integrative Decoding Approach

Transforming Network Engineering with Large Language Models

Transforming Network Engineering with Large Language Models

Meta’s Movie Gen AI Powers New Era of Multimedia Creation with Video, Audio, and Editing Tools

Meta’s Movie Gen AI Powers New Era of Multimedia Creation with Video, Audio, and Editing Tools

ComfyGen Transforms Text-to-Image Generation With Prompt-Based Workflow Adaptation

ComfyGen Transforms Text-to-Image Generation With Prompt-Based Workflow Adaptation

Researchers Develop HELMET to Evaluate Long-Context Models Effectively

Researchers Develop HELMET to Evaluate Long-Context Models Effectively

How AI Masters Human-Like Writing Through Empathy

How AI Masters Human-Like Writing Through Empathy

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.