The enterprise platform to build, deliver, and govern AI
Watch the 15 minute on-demand demo to get an overview of the Domino Enterprise AI Platform.
This Domino Data Science Field Note provides highlights and video clips from Addhyan Pandey’s Domino Data Pop-Up talk, “Leveraging Data Science in the Automotive Industry.” Addhyan Pandey is the Principal Data Scientist at Cars.com. Highlights covered in this blog post include Pandey using word2vec to identify duplicate vehicles on the platform, how his data science team refers to predictive models as “data products”, and the company’s overall approach to data science.
Addhyan Pandey, Principal Data Scientist at Cars.com, explored how data science is implemented across the company and how data science is more than machine learning in his talk, “Leveraging Data Science in the Automotive Industry.” Pandey covers how
Pandey launched his talk with discussing how Cars.com revenue streams include subscription and advertising models. As a result, the problems, or questions, that the data science team tries to solve, go beyond connecting buyers and sellers. The objective is to “manage the overall lifecycle” of a car with multiple points of engagement with both buyers and sellers within the e-commerce marketplace. Pandey points out that a recommendation system, particularly its relevance, is a core value proposition for ensuring that users continue to engage with an e-commerce marketplace.
Pandey noted that when he started at Cars.com, he saw “a huge bunch of text” for the sellers notes and “if I were purchasing a car, I would never ever read that because I’m trained on a website that gives me precise information…." even to figure out “two similar vehicles, it was really tough.” This is a relevant recommendations problem. Pandey decided to address this problem by “putting all of those words in a vector space….aggregate all of those words together in that vector space and have a vector for a particular vehicle.” This allows data scientists to “compute the cosine similarity between the two.” Also, identifying duplicates vehicles on the platform was an added benefit of using word2vec.
Pandey also noted in the talk that “when you talk about data science, it's not really fair to just talk about a specific model… If I just build a model and don't really know how to implement it, I'm not doing justice to the entire system. We call all our predictive models as data products.” He discussed three data products that include
Towards the end of the talk, Pandey discussed how his perspective of data science has changed over the years. In 2011, he saw data science as being a predictive model. Then as he worked with more teams, he expanded his perspective. In 2017, he indicates that the “whole picture” of data science is “more complicated” and that data science teams need to “build a product that is robust…build a model that is scalable, that’s accurate, and has very low computational time.”

Domino Data Lab empowers the largest AI-driven enterprises to build and operate AI at scale. Domino’s Enterprise AI Platform provides an integrated experience encompassing model development, MLOps, collaboration, and governance. With Domino, global enterprises can develop better medicines, grow more productive crops, develop more competitive products, and more. Founded in 2013, Domino is backed by Sequoia Capital, Coatue Management, NVIDIA, Snowflake, and other leading investors.
Watch the 15 minute on-demand demo to get an overview of the Domino Enterprise AI Platform.
Watch the 15 minute on-demand demo to get an overview of the Domino Enterprise AI Platform.