by Michael Gou | May 13, 2025
Forecasting across hierarchical levels — for example, national, retailer, and category — is often inconsistent and inaccurate when done independently. We show, via a CPG case study, how applying hierarchical reconciliation methods with Tickr’s Forecasting Platform improves forecasting accuracy by an average of 14.9% enabling better business decisions.
by Michael Gou | Jul 23, 2024
This post addresses the challenge of identifying substitutable goods in consumer behavior analysis by leveraging large-scale shopping cart data. We introduce a sequential probabilistic model called “SHOPPER” developed by Ruiz, Athey, and Blei, to analyze hundreds of thousands of shopping trips encompassing millions of transactions. Our approach captures the complexity of consumer purchase behavior, considering various factors such as shopping purpose, seasonal variations, in-store promotions, and personal preferences.
by Michael Gou | Jul 23, 2024
In a previous post, substitutable goods were identified by leveraging large-scale shopping cart data to estimate a sequential probabilistic model called “SHOPPER,” developed by Ruiz, Athey, and Blei. In this post, the analysis is expanded by analyzing shopping cart data from several retailers across various geographical regions. Similar products are queried to investigate how the ranked lists of substitutable products vary across different retailers and geographies. These results provide insights into what products are substituted at various retailers and for the same retailers in different geographies.
by Destiny Ziebol | Jul 1, 2024
By creating a retrieval augmented LLM pipeline, (RAG with OpenAI’s GPT-4o), we explored the capability of LLMs to ‘twin’ real respondents in a publicly available survey. By providing the model context of previously answered questions from a particular respondent, we were able to achieve far better than random accuracy at masked questions. This opens the door to utilizing LLMs to augment known sentiment, extending surveys beyond the original question set and providing a far more flexible framework for working with qualitative data from real respondents and their ‘digital twins’.