ML Researcher - Speech

Location: Remote
Experience: 5+ years working with speech technologies including STT, diarization, and speech translation
Compensation: Competitive

Adalat AI

When India's courts struggled with mounting delays and inefficiencies, a team of legal and technology experts from Harvard, Oxford, MIT, and IIIT Hyderabad saw an opportunity for transformation. Adalat AI emerged as a legal-tech startup building India's end-to-end justice tech stack, using artificial intelligence to eliminate judicial delays and enhance access to justice. Operating across 10 Indian states with backing from world-leading foundations, their solutions include cutting-edge ASR models for Indian languages successfully implemented in multiple high courts, including a landmark deployment at Delhi High Court.

Role

Join Adalat AI's mission to transform India's judicial system through cutting-edge speech technology. You'll develop state-of-the-art ASR models for Indian languages that are already revolutionising courtrooms across 10 states, including recent deployment at Delhi High Court.

Responsibilities

Design and implement hybrid and end-to-end speech processing systems using machine learning and deep learning techniques
Preprocess, annotate, and manage large datasets of speech and text for training multilingual legal domain models
Develop data augmentation techniques to improve model robustness across regional dialects and legal terminology
Train and fine-tune speech recognition models using state-of-the-art architectures like wav2vec2, Whisper, and Parakeet
Build and experiment with hybrid speech recognition systems including GMM-HMM and DNN-HMM frameworks
Conduct rigorous evaluation of speech models and implement feedback loops for continuous improvement
Collaborate with product managers and engineers to translate user requirements into technical solutions
Ensure compliance with data privacy and security regulations across all projects
Document research findings and communicate progress effectively to stakeholders and academic partners
Stay current with latest developments in speech recognition and exchange knowledge with team members

Requirements

5+ years of experience working with speech technologies including STT, diarization, and speech translation
Deep knowledge of ASR, multilingual speech recognition, language models, and audio classification
Hands-on experience building hybrid speech recognition systems (GMM-HMM, DNN-HMM)
Expertise with end-to-end speech recognition using modern architectures
Proficiency in machine learning libraries (Scikit Learn, TensorFlow, PyTorch)
Strong programming skills in Python, C/C++, and shell scripting
Bachelor's or Master's in Computer Science, Electrical Engineering, or related field (preferred)
Research publications at conferences or journals (preferred)