
Big Data Engineer vs Data Scientist - Roksolana Diachuk
07/09/21 • 61 min
Links:
- Twitter: https://twitter.com/dead_flowers22
- LinkedIn: https://www.linkedin.com/in/roksolanadiachuk/
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html
Links:
- Twitter: https://twitter.com/dead_flowers22
- LinkedIn: https://www.linkedin.com/in/roksolanadiachuk/
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html
Previous Episode

Build Your Own Data Pipeline - Andreas Kretz
We talked about:
- Andreas’s background
- Why data engineering is becoming more popular
- Who to hire first – a data engineer or a data scientist?
- How can I, as a data scientist, learn to build pipelines?
- Don’t use too many tools
- What is a data pipeline and why do we need it?
- What is ingestion?
- Can just one person build a data pipeline?
- Approaches to building data pipelines for data scientists
- Processing frameworks
- Common setup for data pipelines — car price prediction
- Productionizing the model with the help of a data pipeline
- Scheduling
- Orchestration
- Start simple
- Learning DevOps to implement data pipelines
- How to choose the right tool
- Are Hadoop, Docker, Cloud necessary for a first job/internship?
- Is Hadoop still relevant or necessary?
- Data engineering academy
- How to pick up Cloud skills
- Avoid huge datasets when learning
- Convincing your employer to do data science
- How to find Andreas
Links:
- LinkedIn: https://www.linkedin.com/in/andreas-kretz
- Data engieering cookbook: https://cookbook.learndataengineering.com/
- Course: https://learndataengineering.com/
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html
Next Episode

I Want to Build a Machine Learning Startup! - Elena Samuylova
We talked about:
- Elena’s background
- Why do a startup instead of being an employee?
- Where to get ideas for your startup
- Finding a co-founder
- What should you consider before starting a startup?
- Vertical startup vs infrastructure startup
- ‘AI First’ startups
- Building tools for engineers
- What skills do you need to start a startup?
- Startup risks
- How to be prepared to fail
- Work-life balance
- The part-time startup approach
- Startup investment models
- No resources and no technical expertise – what to do?
- Productionizing your services
- When to hire an expert
- Talking to people with a problem before solving the problem
- Starting Elena’s startup, Evidently
- Elena’s role at Evidently
- Why is Evidently open source?
- “People will just copy my open source code. Should I be concerned?”
- Bottom-up adoption
- Creating value so that clients engage with your product
- Is there a difference between countries when creating a startup?
- Does open source mean the data is safer?
- When should you hire engineers?
- Following the market
- Startups out of genuine interest vs Just for money and for fun
Links:
- EvidentlyAI: https://evidentlyai.com/
- Elena's LinkedIn: https://www.linkedin.com/in/elenasamuylova/
- Elena's Twitter: https://twitter.com/elenasamuylova/
Join DataTalks.Club: https://datatalks.club/slack.html
Our events: https://datatalks.club/events.html
If you like this episode you’ll love
Episode Comments
Generate a badge
Get a badge for your website that links back to this episode
<a href="https://goodpods.com/podcasts/datatalksclub-172386/big-data-engineer-vs-data-scientist-roksolana-diachuk-15473859"> <img src="https://storage.googleapis.com/goodpods-images-bucket/badges/generic-badge-1.svg" alt="listen to big data engineer vs data scientist - roksolana diachuk on goodpods" style="width: 225px" /> </a>
Copy