Product Classification API Part 2: Data Preparation

This post is part 2 of the series on building a product classification API. The API is available for demo here: datagene.io/categorize_web. Part 1 available here; Part 3 available here.

In part 1, we focused on data acquisition and formatting the categories. Here, we’ll focus on preparing the product titles (and short description, if you want) before training our model.

Continue reading

Advertisements

Sharing about my work in Lazada at Strata + Hadoop 2016

Recently, I had the opportunity to share about part of my work at Lazada—ranking products in catalog and search results to improve customer experience and conversion. Conference session details available here.

Here’s the deck presented. Any feedback on how we can improve our ranking framework, or how I can improve my presentation, is welcome.