Log in

goodpods headphones icon

To access all our features

Open the Goodpods app
Close icon
DataTalks.Club - Navigating Challenges and Innovations in Search Technologies - Atita Arora

Navigating Challenges and Innovations in Search Technologies - Atita Arora

12/27/23 • 57 min

DataTalks.Club

We talked about:

  • Atita’s background
  • How NLP relates to search
  • Atita’s experience with Lucidworks and OpenSource Connections
  • Atita’s experience with Qdrant and vector databases
  • Utilizing vector search
  • Major changes to search Atita has noticed throughout her career
  • RAG (Retrieval-Augmented Generation)
  • Building a chatbot out of transcripts with LLMs
  • Ingesting the data and evaluating the results
  • Keeping humans in the loop
  • Application of vector databases for machine learning
  • Collaborative filtering
  • Atita’s resource recommendations

Links:

  • LinkedIn: https://www.linkedin.com/in/atitaarora/
  • Twitter: https://x.com/atitaarora
  • Github: https://github.com/atarora
  • Human-in-the-Loop Machine Learning: https://www.manning.com/books/human-in-the-loop-machine-learning
  • Relevant Search: https://www.manning.com/books/relevant-search
  • Let's learn about Vectors: https://hub.superlinked.com/ Langchain: https://python.langchain.com/docs/get_started/introduction
  • Qdrant blog: https://blog.qdrant.tech/
  • OpenSource Connections Blog: https://opensourceconnections.com/blog/

Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

plus icon
bookmark

We talked about:

  • Atita’s background
  • How NLP relates to search
  • Atita’s experience with Lucidworks and OpenSource Connections
  • Atita’s experience with Qdrant and vector databases
  • Utilizing vector search
  • Major changes to search Atita has noticed throughout her career
  • RAG (Retrieval-Augmented Generation)
  • Building a chatbot out of transcripts with LLMs
  • Ingesting the data and evaluating the results
  • Keeping humans in the loop
  • Application of vector databases for machine learning
  • Collaborative filtering
  • Atita’s resource recommendations

Links:

  • LinkedIn: https://www.linkedin.com/in/atitaarora/
  • Twitter: https://x.com/atitaarora
  • Github: https://github.com/atarora
  • Human-in-the-Loop Machine Learning: https://www.manning.com/books/human-in-the-loop-machine-learning
  • Relevant Search: https://www.manning.com/books/relevant-search
  • Let's learn about Vectors: https://hub.superlinked.com/ Langchain: https://python.langchain.com/docs/get_started/introduction
  • Qdrant blog: https://blog.qdrant.tech/
  • OpenSource Connections Blog: https://opensourceconnections.com/blog/

Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

Previous Episode

undefined - The Entrepreneurship Journey: From Freelancing to Starting a Company - Adrian Brudaru

The Entrepreneurship Journey: From Freelancing to Starting a Company - Adrian Brudaru

We talked about:

  • Adrian’s background
  • The benefits of freelancing
  • Having an agency vs freelancing
  • What let Adrian switch over from freelancing
  • The conception of DLT (Growth Full Stack)
  • The investment required to start a company
  • Growth through the provision of services
  • Growth through teaching (product-market fit)
  • Moving on to creating docs
  • Adrian’s current role
  • Strategic partnerships and community growth through DocDB
  • Plans for the future of DLT
  • DLT vs Airbyte vs Fivetran
  • Adrian’s resource recommendations

Links:

  • Adrian's LinkedIn: https://www.linkedin.com/in/data-team/
  • Twitter: https://twitter.com/dlt_library
  • Github: https://github.com/dlt-hub/dlt
  • Website: https://dlthub.com/docs/intro

Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

Next Episode

undefined - Bayesian Modeling and Probabilistic Programming - Rob Zinkov

Bayesian Modeling and Probabilistic Programming - Rob Zinkov

We talked about:

  • Rob’s background
  • Going from software engineering to Bayesian modeling
  • Frequentist vs Bayesian modeling approach
  • About integrals
  • Probabilistic programming and samplers
  • MCMC and Hakaru
  • Language vs library
  • Encoding dependencies and relationships into a model
  • Stan, HMC (Hamiltonian Monte Carlo) , and NUTS
  • Sources for learning about Bayesian modeling
  • Reaching out to Rob

Links:

  • Book 1: https://bayesiancomputationbook.com/welcome.html
  • Book/Course: https://xcelab.net/rm/statistical-rethinking/

Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

Episode Comments

Generate a badge

Get a badge for your website that links back to this episode

Select type & size
Open dropdown icon
share badge image

<a href="https://goodpods.com/podcasts/datatalksclub-172386/navigating-challenges-and-innovations-in-search-technologies-atita-aro-40185365"> <img src="https://storage.googleapis.com/goodpods-images-bucket/badges/generic-badge-1.svg" alt="listen to navigating challenges and innovations in search technologies - atita arora on goodpods" style="width: 225px" /> </a>

Copy