Line of Service
Internal Firm ServicesIndustry/Sector
Not ApplicableSpecialism
IFS - Internal Firm Services - OtherManagement Level
Senior AssociateJob Description & Summary
A career in Products and Technology is an opportunity to bring PwC's strategy to life by driving products and technology into everything we deliver. Our clients expect us to bring the right people and the right technology to solve their biggest problems; Products and Technology is here to help PwC meet that challenge and accelerate the growth of our business. We have skilled technologists, data scientists, product managers and business strategists who are using technology to accelerate change.Our team designs, develops and programs the methods, processes, and systems that are used to collect all forms of data and develop models that serve predictions to applications, automated process flows, and stakeholders. A Data Scientist collects domain context from stakeholders, defines hypothesis and prediction tasks, identifies and creates supporting data sources, conducts experiments with various algorithms to model prediction tasks, undertakes validation and tests of models to improve performance, produces pipelines that can be used to automate training and predictions with unseen or production data, identifies meaningful insights from data sources, and contextualizes model outputs to communicate with stakeholders (product owners, process managers, and end consumers).
Data Science/ML - Senior Associate
PwC Labs
PwC Labs is focused on standardizing, automating, delivering tools and processes and exploring
emerging technologies that drive efficiency and enable our people to reimagine the possible. Process
improvement, transformation, effective use of innovative technology and data & analytics, and leveraging
alternative delivery solutions are key areas of focus to drive additional value for our firm. If as a
professional you are looking to put your skills to work in a product-based, fast paced, entrepreneurial, and
inclusive environment, PwC Labs is the team for you.
A career in our PwC Labs, will provide you with a unique opportunity to build transformative products and
innovate mechanisms that bring new insights to our business and customers that can help identify
business gaps, solve problems, and build new business opportunities.
Day to day responsibility:
As a Data Scientist, you will be responsible for the below activities:
• Design and develop solutions related to machine learning, natural language processing and deep
learning & Generative AI to address business needs.
• Daily the team utilizes the latest technologies to work creatively and analytically to apply cutting
edge techniques to specific challenges
• You need exceptional skills in data science and continuously expand personal skill sets and stay up
to speed on the latest A.I. trends, tools, methodologies, and techniques.
Skills and Experience
Demonstrates thorough abilities and/or a proven record of success as a team leader including:
Must Have
● Ideally 4 to 6 years of relevant experience.
● Bachelor’s Degree in Computer Science, Engineering or other technical discipline (BE, BTech,
MCA).
● Performing in development language environments- e.g. Python, Java, Scala, R, SQL, etc. and
applying analytical methods to large and complex datasets leveraging one of those languages
● Candidate should have a solid work exposure to Generative AI based projects that includes
designing and implementing solutions based on Langchain framework and designing efficient
prompt for LLM’s.
● Candidate should have good experience in pre-training and fine tuning Large Language Models
(LLM’s) on HuggingFace models & other Large Language Models.
● Candidate should have prior experience on Azure cloud platform.
● Experience in machine learning, natural language processing and deep learning.
● Proven ability with NLP and text-based extraction techniques.
● Familiarity with deep learning architectures used for text analysis, computer vision and signal
processing.
● Understanding of not only how to develop data science analytic models but how to operationalize
these models so they can run in an automated context
● Understanding of machine learning algorithms, such as k-NN, GBM, Neural Networks Naive
Bayes, SVM, and Decision Forests.
● Utilizing and applying knowledge commonly used data science packages including Spark,
Pandas, SciPy, and NumPy.
● Leading, training and working with other data scientists in designing effective analytical
approaches taking into consideration performance and scalability to large datasets
● Experience manipulating and analyzing complex, high-volume, high-dimensionality data from
varying sources
● Applying techniques such as multivariate regressions, Bayesian probabilities, clustering
algorithms, machine learning, dynamic programming, stochastic-processes, queuing theory,
algorithmic knowledge to efficiently research and solve complex development problems and
application of engineering methods to define, predict and evaluate the results obtained.
● Developing and deploying A.I. solutions as part of a larger automation pipeline
Good to Have
● Demonstrates extensive abilities and/or a proven record of success in the application of statistical
modelling, algorithms, data mining and machine learning algorithms problem solving
● A track record of delivery within a number of large-scale projects, demonstrating ownership of
architecture solutions and managing change
● Utilizing programming skills and knowledge on how to write models which can be directly used in
production as part of a large-scale system.
● Utilizing and applying knowledge of technologies such as H20.ai, Google Machine Learning and
Deep learning.
● Developing end to end deep learning solutions for structured and unstructured data problems.
● Using common cloud computing platforms including Azure, AWS and GCP in addition to their
respective utilities for managing and manipulating large data sources, model, development, and
deployment.
● Visualizing and communicating analytical results, using technologies such as HTML, JavaScript,
D3, Tableau, and PowerBI.
Other Skills
● Documenting systems, refining requirements, self-identify solutions and communicate to the
team;
● Demonstrating a desire to keep learning, maintain own skill set, stay up to date and expand one’s
knowledge across the full stack;
● Demonstrating a desire to improve the ‘status quo’, especially automating and improving software
development and operations processes to achieve massively higher delivery velocity and
operations quality;
● Contributing to thought leadership through participation in the development of technology
processes;
● Applying continuous independent judgement while collaborating with others, and influencing
others within the project and domain teams; and,
● Building and leveraging relationships as well as specialist level verbal and written communication
skills.
Preferred Certifications (at least two certifications are preferred):
• Data Science Certifications in Machine Learning, Deep Learning and Natural Language
Processing.
• Certified Professional in Python Programming Level 1 or 2
Education (if blank, degree and/or field of study not specified)
Degrees/Field of Study required:Degrees/Field of Study preferred:Certifications (if blank, certifications not specified)
Required Skills
Optional Skills
Desired Languages (If blank, desired languages not specified)
Travel Requirements
Not SpecifiedAvailable for Work Visa Sponsorship?
NoGovernment Clearance Required?
NoJob Posting End Date