SlideShare a Scribd company logo
Advancing Your
Data Science Career
Alexey Grigorev
Lead Data Scientist at OLX Group
Founder at DataTalks.Club
Hello 👋
I’m Alexey.
��
Me when I started in DS
How to make sure my
projects are useful?
Should I work on something
because it’s cool?
How do I get my models in
production?
Just knowing machine learning
is not enough*
🔧
Me working
at a startup
How to prioritize because
there’s so much to do and
so little time?
How to build this data/ML
platform?
Lead Data Scientist at OLX
Group
Making sure ML projects are
getting adopted by
● Mentoring others
● Scoping projects
Me now
Career tips
● Get into infrastructure
● Be product-oriented
● Use the 80/20 rule
https://tech.olx.com/detecting-image-duplicates-at-olx-scale-7f59e4b6aef4
Mostly engineering work!
Mostly engineering work!
Hidden Technical Debt in Machine Learning Systems
https://papers.nips.cc/paper/2015/file/86df7dcfd896fcaf2674f757a2463eba-Paper.pdf
Career tips
● Get into infrastructure
● Be product-oriented
● Use the 80/20 rule
Get into infrastructure
● Learn AWS / GCP / Azure / whatever
● Deploy models yourself
● Ask for help
● Learn Kubernetes / Terraform / whatever
Career tips
● Get into infrastructure
● Be product-oriented
● Use the 80/20 rule
Picture: CRISP-DM
The most
important step
Be product-oriented
● Help your PM
● Talk to stakeholders
● Ask “why?”
● Take end-to-end ownership
Career tips
● Get into infrastructure
● Be product-oriented
● Use the 80/20 rule
“Full-stack data scientist”
Becoming full-stack
Skills
● Product management
● Data analysis
● Data engineering
● Backend engineering
● DevOps
● Setting up wifi in the office
🤯
Accuracy
Data
Good
Accuracy
Data
Good
Accuracy
Data
Great
Skill proficiency
Time
Great
Skill proficiency
Time
Good
Skill proficiency
Time
Great
Great
Good
OK
Skill proficiency
Time
Skill proficiency
● OK — can use the skill, mostly independently
● Good — can use the skill independently
● Great — expert
We don’t need to be
experts in everything!
Areas
Depth
Product
Management
Data
Engineering
Machine
Learning
Backend
Engineering
DevOps
Areas
Depth
Product
Management
Data
Engineering
Machine
Learning
Backend
Engineering
DevOps
Me a while
ago
Areas
Depth
Product
Management
Data
Engineering
Machine
Learning
Backend
Engineering
DevOps
Areas
Depth
Product
Management
Data
Engineering
Machine
Learning
Backend
Engineering
DevOps
Areas
Depth
Product
Management
Data
Engineering
Machine
Learning
Backend
Engineering
DevOps
Areas
Depth
Product
Management
Data
Engineering
Machine
Learning
Backend
Engineering
DevOps
Areas
Depth
Product
Management
Data
Engineering
Machine
Learning
Backend
Engineering
DevOps
Areas
Depth
Product
Management
Data
Engineering
Machine
Learning
Backend
Engineering
DevOps
Areas
Depth
Product
Management
Data
Engineering
Machine
Learning
Backend
Engineering
DevOps
Me now
Areas
Depth
Product
Management
Data
Engineering
Machine
Learning
Backend
Engineering
DevOps
How other
DS see me *
* maybe not
Areas
Depth
Product
Management
Data
Engineering
Machine
Learning
Backend
Engineering
DevOps
How engineers
see me *
* maybe not
Areas
Depth
Product
Management
Data
Engineering
Machine
Learning
Backend
Engineering
DevOps
Depth
Breadth
Expert level
Good level
Career tips
● Get into infrastructure
● Be product-oriented
● Use the 80/20 rule
“Full-stack data scientist”
80/20 rule
🛠
80% 20%
80%
20%
Effort
Results
📈
80/20 rule
● Break down the role into core areas and skills
● Order the skills by importance
● Pick the most important ones
● Practice practice practice
MoSCoW method
● Create a list of features/ideas
● Mark each as “must-have”, “should-have”, “could-have”
● Take one (or two) the most impactful must-have ones
● Make it work with least effort
● Iterate
DevOps
● Infrastructure
● Automation
● Monitoring
● Reliability
DevOps
● Infrastructure
● Automation
● Monitoring
● Reliability
● AWS
● Kubernetes
● Terraform
Help!
● What’s most important?
● What to select?
Find a mentor! (<== Bonus tip!!!)
Advancing your data science career
Career tips
● Get into infrastructure
● Be product-oriented
● Use the 80/20 rule
● Find a mentor and join a community
“Full-stack data scientist”
@Al_Grigor
agrigorev
DataTalks.Club
Backup
Product management
● Strategy
● UX & Design
● Communication
● Planning
● Evaluation
Product management
● Strategy
● UX & Design
● Communication
● Planning
● Evaluation
● Requirement gathering
● Prioritization
● Stakeholder management
Data engineering
● SQL Databases
● NoSQL Databases
● Stream processing
● Batch processing
● Data pipelines
Data engineering
● SQL Databases
● NoSQL Databases
● Stream processing
● Batch processing
● Data pipelines
● MySQL
● AWS (S3, Kinesis)
● Spark
● Airflow
Backend engineering
● SQL & NoSQL Databases
● CS fundamentals
● Languages and frameworks
● Web services
● Best practices
Backend engineering
● SQL & NoSQL Databases
● CS fundamentals
● Languages and frameworks
● Web services
● Best practices
● Python
● Docker
● Tests
● Clean code

More Related Content

Advancing your data science career