Company Detail

Pocket FM
Member Since,
Login to View contact details
Login

About Company

Job Openings

  • Research Scientist TTS  

    - Bangalore
    About Pocket FMPocket FM is on a mission to deliver personalized and i... Read More

    About Pocket FM


    Pocket FM is on a mission to deliver personalized and immersive audio experiences to listeners worldwide. We are revolutionizing the audio entertainment industry through long-form storytelling, supported by our cutting-edge platform that serves millions of listeners and generates billions of minutes of engagement monthly. We leverage Generative AI in producing content and streamlining operations, developing innovative solutions for cutting-edge challenges in the AI landscape across all modalities-text, audio, and images. With strong backing and rapid user base growth, Pocket FM is an exciting and dynamic place to join.


    About the Role


    We are seeking an experienced research scientist to drive innovation in long-form content generation and localization. Your work will focus on creating seamless, culturally-tailored storytelling experiences, evaluating content quality through user engagement metrics, and transforming research breakthroughs into tangible solutions. You will lead the development of state-of-the-art TTS systems to create highly natural and expressive voices for our immersive audio storytelling platform. Your focus will be on building low-latency, end-to-end neural speech models that can accurately capture emotion and cultural nuances in multiple languages. This role offers the opportunity to contribute to cutting-edge research while also having a direct and measurable impact on the company's success.

    The team is open for the candidate to be located anywhere in North America/India with the requirement to travel occasionally to meet the team in person a few times a year.


    Key Responsibilities


    Model Development: Design, implement, and optimize modern neural TTS systems, including diffusion- and flow-based architectures, neural codec-based speech generation, and LLM-conditioned or hybrid speech synthesis models for expressive, long-form audio.Speech Controllability: Develop methods for fine-grained control over speech attributes like pitch, rhythm, emotion, and speaker style to enhance storytelling quality.Efficiency & Latency: Optimize models for real-time inference and high-scale production, utilizing techniques like knowledge distillation and model quantization.Multilingual Synthesis: Spearhead research into cross-lingual and multilingual TTS to support global content localization.Quality Evaluation: Design and implement robust evaluation frameworks, including MOS (Mean Opinion Score) and objective metrics, to assess the naturalness and intelligibility of generated speech.


    Qualifications


    Demonstrated experience in speech synthesis, digital signal processing (DSP), and audio analysis.Proficiency with speech-specific frameworks and libraries such as Coqui TTS, ESPnet, or NVIDIA NeMo.Hands-on experience with sequence-to-sequence models, GANs, Variational Autoencoders (VAEs), and Diffusion models for audio.Experience in building high-quality audio datasets, including voice cloning, speaker verification, and handling prosody.Master's or PhD degree in Computer Science, Machine Learning, or a related fieldSignificant Python and applied research experience in industrial settingsProficiency in frameworks such as PyTorch or TensorFlowDemonstrated experience in deep learning, especially language modling with transformers and machine translationPrior experience working with vector databases, search indices, or other data stores for search and retrieval use casesPreference for fast-paced, collaborative projects with concrete goals, quantitatively tested through A/B experimentsPublished research in peer-reviewed journals and conferences on relevant topics


    Join us in this exciting opportunity to contribute to ground breaking research in Generative AI technologies that will impact millions of users globally.

    Read Less
  • Engineering Manager  

    - Bangalore
    Engineering ManagerAbout UsPocket FM is a leading audio entertainment... Read More

    Engineering Manager

    About Us

    Pocket FM is a leading audio entertainment platform that brings engaging, serialized fiction to millions of listeners across genres like romance, thriller, fantasy, and more. With over 130 million users globally and strong traction in markets like the US and Europe, we're revolutionizing storytelling through audio.

    Our unique model combines free listening with micropayments for premium content, powering strong business growth. In FY25, we reached an ARR of INR 2,000 crore, with over 100,000 hours of content on the platform. We're also at the forefront of innovation, leveraging AI-generated content to scale efficiently.



    As an Engineering Manager for the Generative AI team, you will spearhead the development of impactful technology solutions and oversee the growth and evolution of our engineering teams. Your leadership will drive cross-functional collaboration, ensuring alignment across teams, and deliver scalable, secure, and high-performing systems that meet customer needs.

    This strategic role demands deep technical expertise, strong execution capabilities, and the ability to guide large, complex initiatives, and innovate across the spectrum of Text to speech (TTS), Image and Video creation, Solving business problems with LLMs. You will build and mentor high-performing teams, fostering a culture of ownership, continuous learning, and collaboration.

    Your leadership will guide the long-term technical success of our products, ensuring they scale effectively while maintaining performance and reliability.


    What We Expect From You :

    Innovate: Connect with business stakeholders and suggest innovative ways to solve the problems using the existing Generative AI solutions.Team Leadership: Build and lead high-performing engineering teams, collaborating with Business, Product, and Leadership to transform ideas into quality products. Must have handled teams building multiple different POC projects at the same time.Solution Design & Implementation: Lead the design and implementation of innovative solutions, focusing on scalability and efficient transition from initial development to broader deployment across multiple products.Cross-Functional Collaboration: Partner with Engineering, Product, and Business teams to align on customer experiences and engineering frameworks, ensuring cohesive and impactful outcomes.Customer-Centric Mindset: Promote a customer-first approach, empowering teams to take ownership and deliver high-quality products that meet customer needs.Operational Excellence: Ensure operational stability and scalability as products evolve, maintaining high standards of performance and reliability.

    Must Haves

    Qualifications:

    B. Tech/B.E. in Computer Science or equivalent with 8+ years of relevant software development experience.

    Required Skills:

    Demonstrated expertise in Generative AI Usecases with extensive experience in designing, building, and productionizing the usecases along with scaling these systems for production Adept at leading from the front with a hands-on approach when necessary.Background in working within early-stage startups, with a demonstrated ability to build systems from scratch.Proficiency in Python.

    Soft Skills & Traits:

    Expertise in building and scaling high-performing engineering teams, encompassing recruitment, onboarding, and mentorship.Skilled in strategic planning, effective delegation, and delivering impactful results.Committed to fostering a culture of continuous improvement through regular feedback, career development, and performance coaching.

    Read Less

Company Detail

  • Is Email Verified
    No
  • Total Employees
  • Established In
  • Current jobs

Google Map

For Jobseekers
For Employers
Contact Us
Astrid-Lindgren-Weg 12 38229 Salzgitter Germany