Please click on the Apply to verify the status of jobs posted more than 15 days ago, as they may have expired. Similar Jobs
Job Description
Job Summary
Responsible for planning and designing new software and web applications. Analyzes, tests and assists with the integration of new applications. Documents all development activity. Assists with training non-technical personnel. Has in-depth experience, knowledge and skills in own discipline. Usually determines own work priorities. Acts as a resource for colleagues with less experience.
About the Role:
We are seeking an experienced Sr. System Analyst to join our growing Global Operational Intelligence team. You will play a key role in building intelligent systems that help reduce alert noise, detect anomalies, correlate events, and proactively surface operational insights across our large-scale streaming infrastructure.
Youll work at the intersection of machine learning, artificial intelligence, observability, and IT operations, collaborating closely with Platform Engineers, SREs, Incident Managers, Operators and Developers to integrate smart detection and decision logic directly into our operational workflows.
This role offers a unique opportunity to push the boundaries of AI/ML in large-scale operations. We welcome futuristic and innovative mindsets who want to stay ahead of the curve, bring innovative ideas to life, and improve the reliability of streaming infrastructure that powers millions of users globally.
What Youll Do
- Analyze, Design and tune machine learning models for big data processing through a multitude of system analysis methods aligning with our design patterns in a cloud environment (AWS, Google, Azure)
- System Testing and Quality Assurance with oversight of quality engineering
- Apply NLP and ML techniques to classify and structure logs and unstructured alert messages
- Develop and maintain real-time and batch data pipelines to process alerts, metrics, traces, and logs
- Use Python, SQL, and time-series query languages (e.g., PromQL) to manipulate and analyze operational data
- Collaborate with engineering teams to deploy models via API integrations, automate workflows, and ensure production readiness
- Contribute to the development of self-healing automation, diagnostics, and ML-powered decision triggers
- Design and validate entropy-based prioritization models to reduce alert fatigue and elevate critical signals
- Conduct A/B testing, offline validation, and live performance monitoring of ML models
- Build and share clear dashboards, visualizations, and reporting views to support SREs, engineers, and leadership
- Research and diagnose complex application problems and identifying system improvements in an enterprise environment.
- Testing the system on a regular basis to ensure quality and function while, writing instruction manuals for the systems
- Collaborate on the design of hybrid ML/AI + rule-based systems to support dynamic correlation and intelligent alert grouping
- Document business process and change algorithms for continuous improvements for assessing complexity in patterns
- Preparing cost benefit analysis on systems platform, and feature and the value chain attributed to the deployed feature and providing recommendations on features that are not used.
- Demonstrate a proactive, solution-oriented mindset with the ability to navigate ambiguity and learn quickly
- Participate in on-call rotations and provide operational support as needed
- Bachelors or Masters degree in Computer Science, Data Science, Machine Learning, Statistics or a related field
- 5+ years of experience building and deploying ML solutions in production environments
- 2+ years working with AIOps, observability, or real-time operations data
- Strong coding skills in Python (including pandas, NumPy, Scikit-learn, PyTorch, or TensorFlow)
- Experience working with SQL, time-series query languages (e.g., PromQL), and data transformation in pandas or Spark
- Familiarity with LLMs, prompt engineering fundamentals, or embedding-based retrieval (e.g., sentence-transformers, vector DBs)
- Strong grasp of modern ML techniques including gradient boosting (XGBoost/LightGBM), autoencoders, clustering (e.g., HDBSCAN), and anomaly detection
- Experience managing structured + unstructured data, and building features from logs, alerts, metrics, and traces
- Familiarity with real-time event processing using tools like Kafka, Kinesis, or Flink
- Strong understanding of model evaluation techniques including precision/recall trade-offs, ROC, AUC, calibration
- Comfortable working with relational (PostgreSQL), NoSQL (MongoDB), and time-series (InfluxDB, Prometheus) databases, GraphDB
- Ability to collaborate effectively with SREs, platform teams, and participate in Agile/DevOps workflows
- Clear written and verbal communication skills to present findings to technical and non-technical stakeholders
- Comfortable working across Git, Confluence, JIRA, & collaborative agile environments
Looking to get Placed? Try our Placement Guarantee Plan
- Experience building or contributing to the AIOps platform (e.g., Moogsoft, BigPanda, Datadog, Aisera, Dynatrace, BMC etc.)
- Experience working in streaming media, OTT platforms, or large-scale consumer services
- Exposure to Infrastructure as Code (Terraform, Pulumi) and modern cloud-native tooling
- Working experience with Conviva, Touchstream, Harmonic, New Relic, Prometheus, & event-based alerting tools
- Hands-on experience with LLMs in operational contexts (e.g., classification of alert text, log summarization, retrieval-augmented generation)
- Familiarity with vector databases (e.g., FAISS, Pinecone, Weaviate) and embeddings-based search for observability data
- Experience using MLflow, SageMaker, or Airflow for ML workflow orchestration
- Knowledge of LangChain, Haystack, RAG pipelines, or prompt templating libraries
- Exposure to MLOps practices (e.g., model monitoring, drift detection, explainability tools like SHAP or LIME)
- Experience with containerized model deployment using Docker or Kubernetes
- Use of JAX, Hugging Face Transformers, or LLaMA/Claude/Command-R models in experimentation
- Experience designing APIs in Python or Go to expose models as services and/or GraphQL
- Cloud proficiency in AWS/GCP, especially for distributed training, storage, or batch inferencing
- Contributions to open-source ML or DevOps communities, or participation in AIOps research/benchmarking efforts
- Certifications in cloud architecture, ML engineering, or data science specializations
Base pay is one part of the Total Rewards that Comcast provides to compensate and recognize employees for their work. Most sales positions are eligible for a Commission under the terms of an applicable plan, while most non-sales positions are eligible for a Bonus. Additionally, Comcast provides best-in-class Benefits to eligible employees. We believe that benefits should connect you to the support you need when it matters most, and should help you care for those who matter most. Thats why we provide an array of options, expert guidance and always-on tools, that are personalized to meet the needs of your reality to help support you physically, financially and emotionally through the big milestones and in your everyday life. Please visit the compensation and benefits summary on our careers site for more details.
Education
Bachelors Degree
While possessing the stated degree is preferred, Comcast also may consider applicants who hold some combination of coursework and experience, or who have extensive related professional experience.
Relevant Work Experience
5-7 Years
Skills
Artificial IntelligenceBig DataPythonData ScienceData ProcessingMachine LearningQuality AssuranceAi/mlAnalystPrompt EngineeringAiMlSqlIf an employer asks you to pay any kind of fee, please notify us immediately. Jobaaj does not charge any fee from the applicants and we do not allow other companies also to do so.
About Company
Comcast is a global media and technology company that offers a broad range of services including high-speed internet, digital TV, and telephone services through its Xfinity brand, as well as feature film and television production through NBCUniversal. Comcast is dedicated to innovation, providing cutting-edge technology solutions for both consumer and business applications, and playing a pivotal role in shaping the future of media and technology.
Careers at Comcast offer a dynamic work environment where creativity and innovation are highly encouraged. Employees benefit from the opportunity to work with some of the most influential technology and media leaders in the industry. Comcast is committed to diversity, inclusion, and personal development, providing a supportive environment that fosters career growth and development. Working at Comcast means being part of a team that values leadership and vision, with ample opportunities to work on projects that have a substantial impact on the digital entertainment landscape.
Important dates & deadlines?
Application Deadline
01 Jan 26, 04:34 PM IST
Similar Jobs
View All

