Category-specific CNN for visual-aware CTR prediction at JD.com

Abstract: As one of the largest B2C e-commerce platforms in China, JD.com also powers a leading advertising system, serving millions of advertisers with fingertip connection to hundreds of millions of customers. In our system, as well as most e-commerce scenarios, ads are displayed with images. This makes visual-aware Click Through Rate (CTR) prediction of crucial importance to both business effectiveness and user experience. Existing algorithms usually extract visual features using off-the-shelf Convolutional Neural Networks (CNNs) and late fuse the visual and non-visual features for the finally predicted CTR. Despite being extensively studied, this field still face two key challenges. First, although encouraging progress has been made in offline studies, applying CNNs in real systems remains non-trivial, due to the strict requirements for efficient end-to-end training and low-latency online serving. Second, the off-the-shelf CNNs and late fusion architectures are suboptimal. Specifically, off-the-shelf CNNs were designed for classification thus never take categories as input features. While in e-commerce, categories are precisely labeled and contain abundant visual priors that will help the visual modeling. Unaware of the ad category, these CNNs may extract some unnecessary category-unrelated features, wasting CNN’s limited expression ability. To overcome the two challenges, we propose Category-specific CNN (CSCNN) specially for CTR prediction. CSCNN early incorporates the category knowledge with a light-weighted attention-module on each convolutional layer. This enables CSCNN to extract expressive category-specific visual patterns that benefit the CTR prediction. Offline experiments on benchmark and a 10 billion scale real production dataset from JD, together with an Online A/B test show that CSCNN outperforms all compared state-of-the-art algorithms. We also build a highly efficient infrastructure to accomplish end-to-end training with CNN on the 10 billion scale real production dataset within 24 hours, and meet the low latency requirements of online system (20ms on CPU). CSCNN is now deployed in the search advertising system of JD, serving the main traffic of hundreds of millions of active users.

06/12/2020

Kalman Filtering Attention for User Behavior Modeling in CTR Prediction

Hu Liu, Jing LU, Xiwei Zhao and
Sulong Xu, Hao Peng, Yutong Liu, Zehua Zhang, Jian Li, Junsheng Jin, Yongjun Bao, Weipeng Yan

convolutional neural networks, neural network transparency, AI explainability, deep Taylor decomposition, supervised classification, zebrafish, transparency, behavioral research, optical flow

4:12

14/06/2020

landmark detection, pose estimation, faces, architectures, high resolution, deep learning, digital humans, performance capture

1:01

03/05/2021

deraining, rainy image, rain residue, semi-supervision learning, gaussian processes, labeled data, unlabeled data, pseudo-ground truth, synthetic data, real-world data.

5:01

06/12/2020

DVERGE: Diversifying Vulnerabilities for Enhanced Robust Generation of Ensembles

Huanrui Yang, Jingyang Zhang, Hongliang Dong and
Nathan Inkawhich, Andrew Gardner, Andrew Touchet, Wesley Wilkes, Heath Berry, Helen Li

Adversarial Examples, Adversarial Perturbation, Adversarial Robustness, Robustness, Adversarial Transferability, Vision Transformers, ViT, MLP-Mixer, CNN

3:17

02/02/2021

Shengyu Zhang, Ziqi Tan, Zhou Zhao and
Jin Yu, Kun Kuang, Tan Jiang, Jingren Zhou, Hongxia Yang, Fei Wu

accounts, classification, clusters, detection, representations, similarity, spaces, stance, terms, tweets, twitter

9:44

02/02/2021

high-dynamic-range imaging, point-spread-function engineering, end-to-end learning, computational imaging, deep learning, optics, photography

5:01

22/11/2021

MVT: Multi-view Vision Transformer for 3D Object Recognition

Shuo Chen, Tan Yu, Ping Li

channel attention, efficient, adaptive 1d convolution, deep cnns, image classifcation, object detection, instance segmentation

0:57

14/06/2020

online learning, visual tracking, continual learning, recursive least-squares estimation, deep learning, memory retention, recursive learning, mini-batch sgd, normal equation, mlp layer

5:01

06/12/2021

Xiaotian Hao, Zhaoqing Peng, Yi Ma and
Guan Wang, Junqi Jin, Jianye Hao, Shan Chen, Rongquan Bai, Mingzhou Xie, Miao Xu, Zhenzhe Zheng, Chuan Yu, HAN LI, Jian Xu, Kun Gai

Hao Tang, Xingwei Liu, Kun Han and
Xiaohui Xie, Xuming Chen, Huang Qian, Yong Liu, Shanlin Sun, Narisu Bai

adversarial defense, feature denoise, multi-task learning, self-supervised learning, image restoration, lipschitz constant constraint, feature pyramid decoder

1:01

05/04/2021