Job Description
Job Overview
As an SDET in the AI Innovation Team, you will be responsible for ensuring the reliability, accuracy, scalability, and quality of AI-powered products and intelligent workflows. You will work closely with AI engineers, backend developers, and product teams to build automated quality frameworks for next-generation AI systems.
This role goes beyond traditional UI automation and focuses on validating LLM-powered applications, AI workflows, APIs, intelligent agents, and data-driven systems. You will design evaluation strategies, automate regression detection, and establish quality guardrails for rapidly evolving AI products.
Responsibilities
AI Quality Engineering
- Design and implement testing strategies for AI/LLM-powered applications and intelligent workflows.
- Build automated evaluation frameworks for validating AI responses, workflows, prompts, and model behaviour.
- Create scalable regression suites to detect quality degradation across AI releases and prompt/model changes.
- Validate AI outputs for accuracy, consistency, hallucination risks, safety, and business relevance.
- Develop and maintain robust automation frameworks for API, integration, backend, and workflow testing.
- Build reusable testing utilities, evaluation pipelines, and quality tooling for AI systems.
- Integrate automated tests into CI/CD pipelines to ensure fast and reliable feedback cycles.
- Test multi-step AI workflows, agent-based systems, orchestration layers, and decision-making pipelines.
- Validate prompt engineering changes, retrieval pipelines, embeddings, and response orchestration logic.
- Design synthetic and real-world test datasets to improve AI evaluation coverage.
- Monitor AI system behaviour using logs, metrics, traces, and evaluation dashboards.
- Identify flaky behaviour, non-deterministic failures, latency issues, and workflow inconsistencies.
- Collaborate with engineering teams to improve system reliability, testability, and operational excellence.
- Partner with AI engineers, developers, and product stakeholders to define quality standards for AI products.
- Drive continuous improvements in AI testing methodologies, tooling, and automation practices.
- Research and adopt emerging approaches in AI testing, evaluation, and quality engineering.
Looking to get Placed? Try our Placement Guarantee Plan
- 3+ years of experience in SDET, QA Automation, or Software Engineering roles.
- Strong programming skills in TypeScript, Java, or Python with a solid understanding of OOP and software design principles.
- Hands-on experience building automation frameworks for APIs, backend systems, or distributed applications.
- Experience with automation tools such as Playwright, Selenium, Cypress, Postman, RestAssured, or equivalent frameworks.
- Understanding of CI/CD pipelines, test orchestration, and quality engineering practices.
- Familiarity with AI/LLM concepts such as prompts, embeddings, RAG pipelines, AI agents, or model evaluation is highly preferred.
- Strong analytical, debugging, and problem-solving skills.
- Ability to work in fast-paced innovation environments with evolving product requirements.
- Excellent communication and collaboration skills.
- Experience testing AI/LLM-powered applications or conversational systems.
- Familiarity with OpenAI, Anthropic, Gemini, or open-source LLM ecosystems.
- Knowledge of observability platforms, telemetry analysis, and performance monitoring.
- Experience with cloud platforms and containerised environments.
Skills
PythonPrompt EngineeringAiIf an employer asks you to pay any kind of fee, please notify us immediately. Jobaaj does not charge any fee from the applicants and we do not allow other companies also to do so.
About Company
Disprz provides a learning management system for corporate organizations. Features include micro-learning modules, content authoring, knowledge sharing forums, live interactive webinars, learning & business analytics, leaderboards, and predictive analysis. A white labelled mobile app with company name and logo is provided. Provides two modules - Disprz for sales and Disprz for Operations. Part of Oracle Cloud Startup Accelerator Programme - Mumbai 2017. Has partner offices in Thailand, Kuwait, Saudi Arabia, and UAE apart from the corporate and sales offices in India. Rivigo, Nokia, Delhivery, Mahindra, Britannia, Huawei, Thyrocare, and Aakash are part of their clientele.
Important dates & deadlines?
Application Deadline
29 Jul 26, 02:57 PM IST
Similar Jobs
View All

