
Job Overview
Location
Italy
Job Type
Part-time
Category
QA Engineer
Date Posted
February 26, 2026
Full Job Description
đź“‹ Description
- • Neurons Lab is seeking a highly detail-oriented and quality-focused QA Engineer with native Italian language proficiency for a critical short-term, part-time engagement. This role is instrumental in ensuring the seamless operation and natural Italian communication of our cutting-edge B2B automation system designed for airline disruption services. When unforeseen flight disruptions occur, our AI system acts as a vital tool for airline operations teams, efficiently identifying and booking necessary accommodations for stranded passengers. Your expertise will be crucial in validating the quality and functionality of this system throughout its lifecycle.
- • This engagement is structured as a part-time role, requiring approximately 0.3 FTE, equating to around 12 hours per week. The project is scheduled to commence on March 16, 2026 (Week 3 of the project) and will span a duration of 10 weeks, concluding at the end of Week 10.
- • The primary objective of this QA Engineer role is to guarantee that our AI voice assistant not only speaks natural, fluent, and contextually appropriate Italian but also meticulously adheres to the correct operational workflows. This encompasses a comprehensive testing approach, from initial development validation through to ongoing production monitoring. You will be the guardian of quality, ensuring our AI meets the highest standards of performance and user experience.
- • Your responsibilities will be divided into two distinct phases:
- • **During Development (Weeks 3-6):**
- • **Voice Quality Assurance:** You will meticulously listen to AI-generated Italian voice calls, performing in-depth evaluations. This includes assessing the naturalness of pronunciation, the correct and appropriate usage of hotel-related vocabulary, and ensuring the formal "Lei" register is consistently and accurately applied in all interactions. This is vital for maintaining a professional and effective communication channel.
- • **Scenario Review and Validation:** You will critically review proposed test scenarios to ensure they accurately reflect realistic operational situations and potential passenger needs during flight disruptions. This involves identifying gaps, ambiguities, or inaccuracies in the scenarios.
- • **Test Case Execution and Defect Reporting:** You will execute predefined test cases and proactively identify and report any defects or deviations from expected behavior. Each defect report must include clear, concise, and actionable reproduction steps to facilitate swift resolution by the development team.
- • **Domain Expertise Collaboration:** You will work closely with the operations team, specifically Alessio and Vanna, to confirm the correctness and domain-specific accuracy of the AI's responses and workflows. This collaborative effort ensures the system is aligned with real-world operational requirements.
- • **Quality Tier Promotion:** You will manage the progression of test scenarios through defined quality tiers, moving them from a 'draft' status to 'reviewed' and finally to 'confirmed' once they meet all quality benchmarks.
- • **During Staging/Production (Weeks 7-10):**
- • **AI Behavior Monitoring:** You will actively monitor the AI's behavior in staging and production environments using advanced observability tools, specifically Langfuse traces. Your focus will be on identifying emergent patterns, pinpointing failures, and detecting any quality drift over time.
- • **End-to-End Scenario Testing:** You will execute comprehensive end-to-end test scenarios against the deployed AI agents to validate their performance in live or near-live conditions.
- • **Structured Feedback for Improvement:** You will provide structured, actionable feedback aimed at enhancing prompt engineering and skill development within the AI. This feedback should go beyond simple error identification, offering concrete suggestions for improvement.
- • **User Acceptance Testing (UAT):** You will actively participate in User Acceptance Testing alongside the operations team, providing a final layer of validation before and during the system's live operation.
- • This role demands a keen attention to process detail, an ability to meticulously follow multi-step workflows, and a sharp eye for spotting errors. Experience with scenario-based testing is essential, as is the ability to provide structured, constructive feedback. While training will be provided for Langfuse, comfort in working with observability and monitoring tools is a significant advantage. A basic understanding of AI/automation systems and familiarity with hotel or hospitality vocabulary in Italian would be highly beneficial.
Skills & Technologies
About Neurons Lab
Neurons Lab is an AI consultancy that provides transformation services to guide organizations into the AI era. They serve clients in financial services, telecoms, retail, and technology, offering AI solutions like NeuraChat, NeuraVoice, and NeuraDoc to transform customer experiences and document workflows. Their services include AI strategy, governance, and training, helping businesses adopt AI securely and ethically. As an Advanced AWS partner, Neurons Lab helps clients build AI prototypes and access funding on AWS. With headquarters in London and Singapore, Neurons Lab has collaborated with industry leaders to deliver measurable outcomes.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities
1 month ago

Nexus Mutual
2 months ago

FundraiseUp Inc.
13 hours ago

FundraiseUp Inc.
13 hours ago
