Speech Recognition News and Research

RSS
Silent Speech Interface Using Graphene-Based Textile Strain Sensors and AI

Silent Speech Interface Using Graphene-Based Textile Strain Sensors and AI

Smart Contact Lens for Precise Eye Tracking

Smart Contact Lens for Precise Eye Tracking

Bridging the Perception Gap: DNNs and Human Peripheral Vision

Bridging the Perception Gap: DNNs and Human Peripheral Vision

Flash Attention Generative Adversarial Network for Enhanced Lip-to-Speech Technology

Flash Attention Generative Adversarial Network for Enhanced Lip-to-Speech Technology

Low-Carbon Transformation in Resource-Based Cities by Integrating ChatGPT and ABC Algorithms

Low-Carbon Transformation in Resource-Based Cities by Integrating ChatGPT and ABC Algorithms

Innovative Vision Transformer for Pothole and Traffic Sign Detection in Challenging Conditions

Innovative Vision Transformer for Pothole and Traffic Sign Detection in Challenging Conditions

Oracle-MNIST Dataset Unveils Challenges for ML in Ancient Chinese Character Recognition

Oracle-MNIST Dataset Unveils Challenges for ML in Ancient Chinese Character Recognition

Optical Meta-Imager Accelerates Machine Vision

Optical Meta-Imager Accelerates Machine Vision

Enhancing Science Education with Multimodal Large Language Models

Enhancing Science Education with Multimodal Large Language Models

RVTALL: Advancing Speech Recognition with Multimodal Dataset

RVTALL: Advancing Speech Recognition with Multimodal Dataset

Exploring Unique Feature Memorization in Deep Neural Networks for Image Classification

Exploring Unique Feature Memorization in Deep Neural Networks for Image Classification

Revolutionizing Automatic Speech Translation with Enhanced Expressivity and Multilingual Capabilities

Revolutionizing Automatic Speech Translation with Enhanced Expressivity and Multilingual Capabilities

Revolutionizing Investigative Interview Training: AI-Powered Virtual Reality with Child Avatars

Revolutionizing Investigative Interview Training: AI-Powered Virtual Reality with Child Avatars

Rainbow: An Expandable Voice User Interface for Scientific Laboratories

Rainbow: An Expandable Voice User Interface for Scientific Laboratories

Advancing Air Traffic Control Safety with Automatic Speech Recognition

Advancing Air Traffic Control Safety with Automatic Speech Recognition

Improving Accent Adaptation in Automatic Speech Recognition with Trainable Codebooks

Improving Accent Adaptation in Automatic Speech Recognition with Trainable Codebooks

Using AI to Advance Air Traffic Control Communication Transcription

Using AI to Advance Air Traffic Control Communication Transcription

Machine Learning in Defense: Ethical and Legal Insights

Machine Learning in Defense: Ethical and Legal Insights

Advancing Linguistic E-Learning with AI Innovations

Advancing Linguistic E-Learning with AI Innovations

Expresso: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis

Expresso: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.