Principal Data Scientist

fulltime

Employment Information

Overview

Do you enjoy solving problems, looking at problems through a different lens, and working closely with customers to innovate new solutions to complex problems? Do you jump with excitement at the opportunity to identify trends and provide unique business solutions? Do you want to join a team where learning about a new technology or solution is part of our work every day?

The Industry Solutions Engineering (ISE) team is a global engineering organization that works directly with customers looking to leverage the latest technologies to address their toughest challenges. We work closely with our customers’ engineers to jointly develop code for cloud-based solutions that can accelerate their organization. We work in collaboration with Microsoft product teams, partners, and open-source communities to empower our customers to do more with the cloud. We pride ourselves in making contributions to open source and making our platforms easier to use.

We develop solutions side-by-side with our customers through collaborative innovation to solve their challenges. This work involves the development of broadly applicable, high-impact solution patterns and open-source software assets that contribute to the Microsoft platform. In this role, you will be working with engineers from your team and our customers’ teams to apply your skills, perspectives, and creativity to grow as engineers and help solve our customers’ toughest challenges.

We are hiring a Data Scientist with deep experience in data management and expertise in developing statistical techniques to analyze data and find patterns. As part of our team, you will be working side-by-side with high-impact engineers and strategic customers to solve complex problems. You will communicate trends and innovative solutions to stakeholders. You will work cross-functionally with several teams including crews, product teams, and program management to deploy business solutions.

Our team prides itself on embracing a growth mindset, inspiring excellence, and encouraging everyone to share their unique viewpoints and be their authentic selves. Join us and help create life-changing innovations that impact billions around the world!

Responsibilities

Business Understanding and Impact

  • Leverages subject matter expertise to analyze problems and issues facing projects to uncover, manage, and/or mitigate factors that can influence final outcomes across product lines. Partners with business team to drive strategy and recommend improvements. Raises opportunities to look for new work opportunities and different contexts to use existing work. Establishes, applies, and teaches standards and best practices.

Data Preparation and Understanding

  • Oversees data acquisition efforts and ensures data is properly formatted and accurately described. Utilizes key technologies and tools necessary for data exploration (e.g., structured query language [SQL], Python). Uses querying, visualization, and reporting techniques to explore the data, including distribution of key attributes, relationships between attributes, simple aggregations, properties of significant sub-populations, and statistical analyses. Mentors and coaches engineers in data cleaning and analysis best practices. Identifies gaps in current data sets and drives onboarding of new data sets (e.g., bringing on third-party data sets). Drives discussions around ethics and privacy policies related to collecting and preparing data. Integrates industry-wide ethics insights and best practices to influence internal processes and drive decision-making. Builds data platforms from scratch across products. Builds data-science business solutions using existing technologies, products, and solutions, as well as established patterns and practices. Provides guidance on model operationalization of models created by data scientists. Identifies new opportunities from data and processes data in a way that is usable for general purpose. Actively contributes to the body of thought leadership and intellectual property (IP) on best practices for data acquisition and understanding. Leads and resolves data-integrity problems.

Modeling and Statistical Analysis

  • Generalizes machine learning (ML) solutions into repeatable frameworks (e.g., modules, packages, general-purpose software) for others to use. Exemplifies and enforces team standards related to bias, privacy, and ethics. Evaluates the methodology and performance of teammates’ models and, as appropriate, recommends solutions for improvement. Anticipates the risks of data leakage, the bias/variance tradeoff, methodological limitations, etc., and is able to guide teammates on solutions. Drives best practices relative to model validation, implementation, and application. Develops operational models that run at scale. Partners with others to identify and explore opportunities for the application of ML and predictive analysis. Identifies new customer opportunities for driving transformative customer solutions with ML modeling. Incorporates best practices for ML modeling with consideration for artificial intelligence (AI) ethics. Develops deep expertise in specialized areas by staying abreast of current and emerging methodologies an AI and ML.

Evaluating for Insight and Impact

  • Conducts thorough review of data analysis and techniques used to summarize the process review and highlight areas that have been missed or need reexamining. Utilizes results of the assessment and process review to decide on next steps (e.g., deployment, further iterations, new projects). Identifies new evaluation approaches and metrics and invents new methodologies to evaluate models.

Industry and Research Knowledge/Opportunity Identification

  • Tracks advances in industry and academia, identifies relevant state-of-the-art research, and adapts algorithms and/or techniques to drive innovation and develop new solutions. Researches and maintains deep knowledge of industry trends, technologies, and advances. Leverages knowledge of work being done on team to propose collaboration efforts. Proactively develops strategic responses to specific market strengths, weaknesses, opportunities, threats, and/or trends. Mentors and coaches less experienced engineers in data analysis best practices. Serves as a subject matter expert and role model for less experienced engineers. Identifies strategy opportunities. Actively contributes to the body of thought leadership and intellectual property (IP) best practices by actively participating in external conferences.

Coding and Debugging

  • Independently writes efficient, readable, extensible code/model that spans multiple features/solutions. Contributes to the code/model review process by providing feedback and suggestions for implementation and improvement. Develops expertise in proper modeling, coding, and/or debugging techniques such as locating, isolating, and resolving errors and/or defects. Leads a project team in the gathering, integrating, and interpreting of data/information from multiple sources in order to properly troubleshoot errors. Provides feedback on non-optimized features/solutions back to product group, and explores potential for new features. Leverages expert-level proficiency of big-data software engineering concepts, such as Hadoop Ecosystem, Apache Spark, continuous integration and continuous delivery (CI/CD), Docker, Delta Lake, MLflow, AML, and representational state transfer (REST) application programming interface (API) consumption/development.

Business Management

  • Defines business-strategy goals, customer-strategy goals, and solution-strategy goals. Partners with teams to identify and explore opportunities for the application of machine learning (ML) and other data-science tools. Leverages technical expertise to develop partnerships between product teams, Sales teams, Area teams, and Services. Work collaboratively across disciplines. Leads involvement of intellectual property (IP) definition improvement. Coaches and mentors less experienced engineers.

Customer/Partner Orientation

  • Commits to a customer-oriented focus by acknowledging customer needs and perspectives, validating customer perspectives, focusing on broader customer organization/context, and serving as a trusted advisor. Identifies opportunities and adds valuable insight by incorporating an understanding of the business, product/service functionality, data sources, methodologies to reframe problems, and the customer perspective. Interprets results, develops insights, and effectively communicates results to the customer. Leads the discussion with customers and offers pragmatic solutions that are feasible given their data limitations.

Other

  • Embody our culture and values

Qualifications

Required Qualifications:

  • Doctorate in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 5+ years data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results)
  • OR Master's Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 7+ years data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results)
  • OR Bachelor's Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 10+ years data science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results)
  • OR equivalent experience.

Preferred Qualifications;

  • Doctorate in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 8+ years data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results)
  • OR Master's Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 10+ years data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results)
  • OR Bachelor's Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 12+ years data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results)
  • OR equivalent experience.
  • Experience working as part of geographically dispersed, diverse, and virtual teams
  • Enjoy travel and are comfortable with travel up to 25%
  • Demonstrated ability to work with customers and collaborate across company boundaries.
  • 5+ years customer-facing, project-delivery experience, professional services, and/or consulting experience.

Our team prides itself on embracing a growth mindset, inspiring excellence, and encouraging everyone to share their unique viewpoints and be their authentic selves. Join us and help create life-changing innovations that impact billions around the world!

At Microsoft, we are seeking people who have a passion for the positive impact technology can have on communities and for making a difference in the world. Within ISE, you will find a wide range of backgrounds, perspectives, personal and cultural experiences which are vital to our success with our customers.  It’s an informal and flexible work environment and you’ll be welcome to work in the way that best enables you to get your job done.

We invest in your health, wellness, and financial future by offering a competitive package including a wide range of benefits built around your personal needs and those close to you.  

joxBox

Join our newsletter to get monthly updates on data science jobs.

joxBox