Job Description
The Data Scientist executes end-to-end Data Science and Machine Learning workflows under guidance to deliver measurable value in industrial operations. You will focus on data acquisition/cleaning, feature engineering, exploratory analysis, model development/validation, and contribute to deployment and monitoring in collaboration with data/platform engineering. Primary use cases include time-series forecasting, anomaly detection, and predictive maintenance, with Generative AI (GenAI) as an added advantage. Exposure to operations in Oil & Gas, Fossil Power, or Renewable Power is a plus
Roles and Responsibilities
- Execute DS/ML tasks across the model lifecycle: data acquisition, quality assessment/cleansing, feature engineering, and exploratory data analysis on industrial datasets (sensor/telemetry, logs, emissions, maintenance) with reproducible workflows.
- Train, tune, and validate models (regression, classification, and time-series methods such as ARIMA/Prophet; anomaly detection; ensembles). Document experiments and results clearly.
- Collaborate with data/platform engineering to contribute to data pipelines and model serving; support deployment activities and basic monitoring/drift checks under supervision.
- Produce clean, well-structured Python/SQL code; follow coding standards, version control, and experiment tracking practices.
- Create visual analyses and concise summaries to communicate findings and recommendations to teammates and stakeholders; incorporate feedback to iterate.
- Assist in measuring outcomes against success metrics (e.g., reliability, availability, efficiency, emissions, cost) and maintain project artifacts (reports, annotated code).
- Learn industrial context, data sources, and domain constraints; proactively identify data quality issues and propose fixes within established procedures.
- Participate in POCs/pilots, including GenAI/LLM-assisted analytics workflows (analytics automation, documentation) as an added advantage.
For roles in USA: Bachelors Degree in Computer Science or STEM Majors (Science, Technology, Engineering and Math) with minimum 3 years of experience.
Desired Characteristics
Technical Expertise
- Proficient in Python and SQL; hands-on with Pandas, NumPy, scikit-learn; basic exposure to TensorFlow/PyTorch is a plus.
- Applied experience with feature engineering, model selection, cross-validation, and performance measurement for time-series and classification/regression problems.
- Solid foundations in data management: ETL basics, data quality checks/cleansing, and working with large datasets.
- Familiarity with cloud ML platforms (e.g., AWS SageMaker, Azure ML, GCP Vertex AI) and MLOps concepts (experiment tracking, model registry, monitoring) is a plus.
- Competent in visual analytics for EDA and communicating insights; experience with dashboards or notebooks preferred.
- Familiarity with big data/streaming technologies (e.g., Spark, Kafka) and real-time analytics considerations is a plus.
- Exposure to industrial operations (Oil & Gas, Fossil Power, Renewable Power) is a plus; ability to learn failure modes, maintenance strategies, and process constraints and translate them into features and validation criteria.
- Basic understanding of business drivers and operational KPIs (availability, MTBF/MTTR, throughput, energy yield, emissions, cost) with the ability to connect analytical results to business value.
Looking to get Placed? Try our Placement Guarantee Plan
- Operates within established procedures with some autonomy; collaborates effectively with direct colleagues and seeks guidance for issues outside defined parameters.
- Applies structured problem solving and analytical thinking; proposes improvements within set practices.
- Builds strong working relationships; may guide interns or junior teammates on routine tasks.
- Curiosity and continuous learning mindset; connects ideas and incorporates feedback quickly.
- Comfort in ambiguity at project/task level; states assumptions clearly and adapts based on new information.
- Clear communicator; explains technical information to teammates and stakeholders and asks clarifying questions to ensure shared understanding.
This is intended to provide a high level guide to the role. However, it is not intended to amend or otherwise restrict/expand the duties required from each individual employee as set out in their respective employment contract and/or as otherwise agreed between an employee and their manager.
Additional Information
Relocation Assistance Provided: Yes
Skills
Big DataPythonData AnalysisData ScienceEtlMachine LearningData ScientistAnalyticsData ManagementAiMlSqlIf an employer asks you to pay any kind of fee, please notify us immediately. Jobaaj does not charge any fee from the applicants and we do not allow other companies also to do so.
Important dates & deadlines?
Application Deadline
01 Apr 26, 03:42 PM IST
Similar Jobs
View All



