Data Scientist
StraiveStraive is a global leader in enterprise-grade data analytics and AI solutions, committed to empowering businesses across various industries with cutting-edge technology and expert insights. Backed by EQT, a top private equity firm, we are uniquely positioned to drive innovation through significant investments and an entrepreneurial spirit
Our core focus is on delivering advanced Data Analytics & AI Solutions. By combining sophisticated technology with subject matter expertise, we deliver material impact on our clients' topline and streamline their operations. We specialize in providing tailored solutions across financial services, CPG, legal, pharma, life sciences, retail and logistics, helping them build robust data analytics and AI capabilities.
With a client base spanning 30 countries, Straive's strategically located teams operate from eight countries and is headquartered in Singapore. This global presence enables us to offer localized expertise with a worldwide perspective.
Join Straive to be part of a dynamic team at the forefront of data analytics and AI innovation. Here, you'll have the opportunity to contribute to transformative projects, supported by significant investments and an entrepreneurial drive fueled by our partnership with EQT.
Website: https://www.straive.com/
Job Title: Data Scientist
Location: San Francisco Bay Area, CA (or) New York City, NY.
Experience: 4-7 Years
Primary focus : Model reproduction, feature engineering logic, performance validation, and ensuring alignment with Client's established modeling frameworks.
• Rebuild and port existing Clilent's Python based models into customer’s Databricks platform.
• Develop, train, and validate predictive models using Python, PySpark, and ML frameworks such as scikitlearn, XGBoost, and Spark MLlib.
• Develop, validate and reproduce feature engineering logic and ensure parity with Client's
models.
• Train, retain, validate, and benchmark model performance using customer provided datasets while maintaining performance parity with baseline models.
• Work with data engineers to define feature requirements and ensure datasets support model needs.
• Perform model diagnostics, bias checks, stability checks, and accuracy assessments.
• Prepare model documentation, validation summaries, and stakeholder ready insights.
• Support scoring pipeline design and ensure reproducibility across Dev/QA/Prod.
• Collaborate with compliance and platform teams to ensure adherence to governance.
• Perform model diagnostics, hyperparameter tuning, and stability analysis.
• Evaluate model performance across population segments and time periods.
• Work with platform and engineering teams to support scoring pipeline deployment across Dev/QA/Prod.
Qualifications:
• 4–6 years of experience in applied machine learning or data science.
• Strong hands-on experience with Python, scikit-learn, XGBoost, LightGBM, CatBoost, or similar libraries.
• Experience developing ML models in Databricks with Python or PySpark.
• Strong knowledge of feature engineering, model training workflows, and evaluation techniques.
• Experience working with large structured datasets (financial or transactional data preferred).
• Ability to write clear documentation and communicate technical results to non-technical stakeholders.
• 4+ years of hands-on experience developing, deploying, and maintaining machine-learning models.
• Advanced proficiency in Python (NumPy, pandas, scikit-learn, PyTorch or TensorFlow).
• Strong statistical and mathematical foundation, including regression, classification, probability,
optimization, etc.
• Experience building end-to-end ML pipelines: data ingestion, cleaning, feature engineering, modeling, evaluation, deployment.
• Experience working within client environments, including adapting to unfamiliar infrastructure, constraints, and security requirements.
• Experience with cloud platforms (AWS, Azure, or GCP) and on-prem environments.
• Advanced SQL ability and experience with big-data tools (Spark, Databricks, Hadoop).
This job description is not intended to cover or contain a comprehensive listing of all responsibilities, duties, or activities that are required. Responsibilities, duties, and/or activities may change, or new ones may be added at any time with or without notice.
If you are a motivated professional with a passion for delivering impactful solutions, we’d love to hear from you. Apply today to be part of a dynamic and forward-thinking team at Straive.
“Straive is an Equal Opportunity Employer. Our policy is clear: there shall be no discrimination based on age, disability, sex, race, religion or belief, gender reassignment, marriage/civil partnership, pregnancy/maternity, or sexual orientation.
We are an inclusive organization and actively promote equality of opportunity for all with the right mix of talent, skills and potential. We welcome all applications from a wide range of candidates. Selection for roles will be based on individual merit alone.”