BERT-based similarity learning for product matching

08/12/2020

BERT-based similarity learning for product matching

Janusz Tracz, Piotr Iwo Wójcik, Kalina Jasinska-Kobus, Riccardo Belluzzo, Robert Mroczkowski, Ireneusz Gawlik

Keywords:

Abstract Paper Similar Papers

Abstract: Product matching, i.e., being able to infer the product being sold for a merchant-created offer, is crucial for any e-commerce marketplace, enabling product-based navigation, price comparisons, product reviews, etc. This problem proves a challenging task, mostly due to the extent of product catalog, data heterogeneity, missing product representants, and varying levels of data quality. Moreover, new products are being introduced every day, making it difficult to cast the problem as a classification task. In this work, we apply BERT-based models in a similarity learning setup to solve the product matching problem. We provide a thorough ablation study, showing the impact of architecture and training objective choices. Application of transformer-based architectures and proper sampling techniques significantly boosts performance for a range of e-commerce domains, allowing for production deployment.

The video of this talk cannot be embedded. You can watch it here:

https://underline.io/lecture/6605-bert-based-similarity-learning-for-product-matching

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at COLING Workshops 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

01/07/2020

Deep Learning-based Online Alternative Product Recommendations at Scale

Mingming Guo, Nian Yan, Xiquan Cui and
San He Wu, Unaiza Ahsan, Rebecca West, Khalifeh Al Jadda

Keywords Paper

0

0

0

0

18:02

19/10/2020

Gated heterogeneous graph representation learning for shop search in e-commerce

Xichuan Niu, Bofang Li, Chenliang Li and
Rong Xiao, Haochuan Sun, Honggang Wang, Hongbo Deng, Zhenzhong Chen

Keywords Paper

e-commerce, gated mechanism, heterogeneous graph, shop search

0

0

0

0

5:19

02/02/2021

Traffic Shaping in E-Commercial Search Engine: Multi-Objective Online Welfare Maximization

Liucheng Sun, Chenwei Weng, Chengfu Huo and
Weijun Ren, Guochuan Zhang, Xin Li

Keywords Paper

0

0

0

0

13:50

19/10/2020

MTBRN: Multiplex target-behavior relation enhanced network for click-through rate prediction

Yufei Feng, Fuyu Lv, Binbin Hu and
Fei Sun, Kun Kuang, Yang Liu, Qingwen Liu, Wenwu Ou

Keywords Paper

click-through rate prediction, recommender system

0

0

0

0

10:08

19/10/2020

Learning to profile: User meta-profile network for few-shot learning

Hao Gong, Qifang Zhao, Tianyu Li and
Derek Cho, DuyKhuong Nguyen

Keywords Paper

multi-task learning, multi-modal model, representation learning, meta-learning

0

0

0

1

12:10

02/06/2020

StreamPipes Connect: Semantics-Based Edge Adapters for the IIoT

Philipp Zehnder, Patrick Wiener, Tim Straub, Dominik Riemer

Keywords Paper

0

0

0

0

29:30

22/09/2020

Query as context for item-to-item recommendation

Moumita Bhattacharya, Amey Barapatre

Keywords Paper

0

0

0

0

3:46

05/12/2020

Answering product-related questions with heterogeneous information

Wenxuan Zhang, Qian Yu, Wai Lam

Keywords Paper

0

0

0

0

13:53

01/07/2020

Using Large Pretrained Language Models for Answering User Queries from Product Specifications

Kalyani Roy, Smit Shah, Nithish Pai and
Jaidam Ramtej, Prajit Nadkarni, Jyotirmoy Banerjee, Pawan Goyal, Surender Kumar

Keywords Paper

0

0

0

0

15:44

03/05/2021

Latent Skill Planning for Exploration and Transfer

Kevin Xie, Homanga Bharadhwaj, Danijar Hafner and
Animesh Garg, Florian Shkurti

Keywords Paper

Partial Amortization, Model Predictive Control, Planning, Mutual Information, Skill Discovery, World Models, Model-Based Reinforcement Learning

0

0

0

0

5:10

19/10/2020

ADMSCN: A novel perspective for user intent prediction in customer service bots

Kuan Xu, Chilin Fu, Xiaolu Zhang and
Cen Chen, Ya-Lin Zhang, Wenge Rong, Zujie Wen, Jun Zhou, Xiaolong Li, Yu Qiao

Keywords Paper

recommender system, multiple instancelearning, user intent prediction

0

0

0

0

8:51

14/09/2020

Why did my Consumer Shop? Learning an Efficient Distance Metric for Retailer Transaction Data

Yorick Spenrath, Marwan Hassani, Boudewijn van Dongen, Haseeb Tariq

Keywords Paper

distance metric, transaction categorization, clustering, optimization

0

0

0

0

15:14

19/10/2020

AliMeKG: Domain knowledge graph construction and application in e-commerce

Feng-Lin Li, Hehong Chen, Guohai Xu and
Tian Qiu, Feng Ji, Ji Zhang, Haiqing Chen

Keywords Paper

e-commerce, pre-sales customer service, domain knowledge graph

0

0

0

0

6:46

25/07/2020

Evolutionary product description generation: A dynamic fine-tuning approach leveraging user click behavior

Yongzhen Wang, Jian Wang, Heng Huang and
Hongsong Li, Xiaozhong Liu

Keywords Paper

product description generation, neural network, sequence-to-sequence, click-through rate, reinforcement learning

0

0

0

0

14:34

23/08/2020

Learning transferrable parameters for long-tailed sequential user behavior modeling

Jianwen Yin, Chenghao Liu, Weiqing Wang and
Jianling Sun, Steven C. H. Hoi

Keywords Paper

long-tailed distribution, adversarial training, gradient alignment, sequential user behavior modeling

0

0

0

0

8:21

04/07/2020

Hiring Now: A Skill-Aware Multi-Attention Model for Job Posting Generation

Liting Liu, Jie Liu, Wenzheng Zhang and
Ziming Chi, Wenxuan Shi, Yalou Huang

Keywords Paper

Job Generation, recruiting process, conditional problem, Skill-Aware Model

0

0

0

0

9:19

16/11/2020

AnswerFact: Fact Checking in Product Question Answering

Wenxuan Zhang, Yang Deng, Jing Ma, Wai Lam

Keywords Paper

online shopping, evidence-based tasks, answer problem, product-related platforms

0

0

0

0

12:15

19/10/2020

Deep multifaceted transformers for multi-objective ranking in large-scale e-commerce recommender systems

Yulong Gu, Zhuoye Ding, Shuaiqiang Wang and
Lixin Zou, Yiding Liu, Dawei Yin

Keywords Paper

click-through rate prediction, conversation rate prediction, recommender systems, e-commerce, multi-task learning

0

0

0

0

10:34

19/08/2021

Controlling Fairness and Bias in Dynamic Learning-to-Rank (Extended Abstract)

Marco Morik, Ashudeep Singh, Jessica Hong, Thorsten Joachims

Keywords Paper

Machine Learning, Learning Preferences or Rankings, Fairness, Information Retrieval, Online Learning

0

0

0

0

14:01

25/07/2020

Towards personalized and semantic retrieval: An end-to-end solution for e-commerce search via embedding learning

Han Zhang, Songlin Wang, Kang Zhang and
Zhiling Tang, Yunjiang Jiang, Yun Xiao, Weipeng Yan, Wen-Yun Yang

Keywords Paper

semantic matching, search, neural networks

0

0

0

0

16:13

01/07/2020

How to Grow a (Product) Tree: Personalized Category Suggestions for eCommerce Type-Ahead

Jacopo Tagliabue, Bingqing Yu, Marie Beaulieu

Keywords Paper

0

0

0

0

14:56

16/11/2020

Multimodal Joint Attribute Prediction and Value Extraction for E-commerce Product

Tiangang Zhu, Yue Wang, Haoran Li and
Youzheng Wu, Xiaodong He, Bowen Zhou

Keywords Paper

e-commerce scenarios, product retrieval, attribute tasks, multimodal method

0

0

0

0

10:29

02/02/2021

Taxonomy Completion via Triplet Matching Network

Jieyu Zhang, Xiangchen Song, Ying Zeng and
Jiaze Chen, Jiaming Shen, Yuning Mao, Lei Li

Keywords Paper

0

0

0

0

19:59

12/07/2020

Robust Pricing in Dynamic Mechanism Design

Yuan Deng, Sébastien Lahaie, Vahab Mirrokni

Keywords Paper

Learning Theory

0

0

0

0

15:48

25/07/2020

Controlling fairness and bias in dynamic learning-to-rank

Marco Morik, Ashudeep Singh, Jessica Hong, Thorsten Joachims

Keywords Paper

learning-to-rank, selection bias, exposure, fairness, ranking, bias

0

0

0

0

13:55

19/10/2020

Deep multi-interest network for click-through rate prediction

Zhibo Xiao, Luwei Yang, Wen Jiang and
Yi Wei, Yi Hu, Hao Wang

Keywords Paper

click-through rate prediction, multi-interest, recommender system

0

0

0

0

6:09

02/02/2021

An LP-Based Approach for Goal Recognition as Planning

Luísa R. A. Santos, Felipe Meneguzzi, Ramon Fraga Pereira, André Grahl Pereira

Keywords Paper

0

0

0

0

19:54

02/02/2021

An End-to-End Solution for Named Entity Recognition in eCommerce Search

Xiang Cheng, Mitchell Bowden, Bhushan Ramesh Bhange and
Priyanka Goyal, Thomas Packer, Faizan Javed

Keywords Paper

0

0

0

0

18:46

18/07/2021

AutoAttend: Automated Attention Representation Search

Chaoyu Guan, Xin Wang, wenwu zhu

Keywords Paper

Algorithms, AutoML

0

0

0

0

4:49

25/07/2020

Answer ranking for product-related questions via multiple semantic relations modeling

Wenxuan Zhang, Yang Deng, Wai Lam

Keywords Paper

product question answering, e-commerce, answer ranking, question answering

0

0

0

0

15:43

25/04/2020

DFSeer: A Visual Analytics Approach to Facilitate Model Selection for Demand Forecasting

Dong Sun, Zezheng Feng, Yuanzhe Chen and
Yong Wang, Jia Zeng, Mingxuan Yuan, Ting-Chuen Pong, Huamin Qu

Keywords Paper

interactive visualization, model selection, product demand forecasting, time series

0

0

0

0

12:22

19/10/2020

Bid shading in the brave new world of first-price auctions

Djordje Gligorijevic, Tian Zhou, Bharatbhushan Shetty and
Brendan Kitts, Shengjun Pan, Junwei Pan, Aaron Flores

Keywords Paper

bid shading, online bidding, factorization machines

0

0

0

0

9:59

19/10/2020

AutoADR: Automatic model design for ad relevance

Yiren Chen, Yaming Yang, Hong Sun and
Yujing Wang, Yu Xu, Wei Shen, Rong Zhou, Yunhai Tong, Jing Bai, Ruofei Zhang

Keywords Paper

neural architecture search, knowledge distillation, ad relevance

0

0

0

0

9:24

06/12/2021

CATs: Cost Aggregation Transformers for Visual Correspondence

Seokju Cho, Sunghwan Hong, Sangryul Jeon and
Yunsung Lee, Kwanghoon Sohn, Seungryong Kim

Keywords Paper

robustness, transformers

0

0

0

0

6:16

23/08/2020

Dynamic heterogeneous graph neural network for real-time event prediction

Wenjuan Luo, Han Zhang, Xiaodi Yang and
Lin Bo, Xiaoqing Yang, Zang Li, Xiaohu Qie, Jieping Ye

Keywords Paper

heterogeneous graph neural networks, dynamic graph embedding, real-time event embedding

0

0

0

0

18:23

23/08/2020

Towards automated neural interaction discovery for click-through rate prediction

Qingquan Song, Dehua Cheng, Hanning Zhou and
Jiyan Yang, Yuandong Tian, Xia Hu

Keywords Paper

neural architecture search, evolutionary algorithm, CTR prediction

0

0

0

0

18:00

02/02/2021

Reinforcement Learning-based Product Delivery Frequency Control

Yang Liu, Zhengxing Chen, Kittipat Virochsiri and
Juan Wang, Jiahao Wu, Feng Liang

Keywords Paper

0

0

0

1

18:34

19/04/2021

CDA: A cost efficient content-based multilingual web document aligner

Thuy Vu, Alessandro Moschitti

Keywords Paper

0

0

0

0

11:04

23/08/2020

Enterprise cooperation and competition analysis with a sign-oriented preference network

Le Dai, Yu Yin, Chuan Qin and
Tong Xu, Xiangnan He, Enhong Chen, Hui Xiong

Keywords Paper

signed network, graph embedding, enterprise analysis

0

0

0

0

12:31

23/08/2020

Learning to extract attribute value from product via question answering: A multi-task approach

Qifan Wang, Li Yang, Bhargav Kanagal and
Sumit Sanghai, D. Sivakumar, Bin Shu, Zac Yu, Jon Elsas

Keywords Paper

question answering, generalization, attribute value extraction

0

0

0

0

17:56