Measuring and Modeling the Label Dynamics of Online Anti-Malware Engines

12/08/2020

Measuring and Modeling the Label Dynamics of Online Anti-Malware Engines

Shuofei Zhu, Jianjun Shi, Limin Yang, Boqin Qin, Ziyi Zhang, Linhai Song, Gang Wang

Keywords:

Abstract Paper Similar Papers

Abstract: VirusTotal provides malware labels from a large set of anti-malware engines, and is heavily used by researchers for malware annotation and system evaluation. Since different engines often disagree with each other, researchers have used various methods to aggregate their labels. In this paper, we take a data-driven approach to categorize, reason, and validate common labeling methods used by researchers. We first survey 115 academic papers that use VirusTotal, and identify common methodologies. Then we collect the daily snapshots of VirusTotal labels for more than 14,000 files (including a subset of manually verified ground-truth) from 65 VirusTotal engines over a year. Our analysis validates the benefits of threshold-based label aggregation in stabilizing files’ labels, and also points out the impact of poorly-chosen thresholds. We show that hand-picked “trusted” engines do not always perform well, and certain groups of engines are strongly correlated and should not be treated independently. Finally, we empirically show certain engines fail to perform in-depth analysis on submitted files and can easily produce false positives. Based on our findings, we offer suggestions for future usage of VirusTotal for data annotation.

0

0

0

0

Share

USENIX_Security

This is an embedded video. Talk and the respective paper are published at USENIX Security 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

29/06/2020

On the prevalence, impact, and evolution of SQL code smells in data-intensive systems

Biruk Asmare Muse, Mohammad Masudur Rahman, Csaba Nagy and
Anthony Cleve, Foutse Khomh, Giuliano Antoniol

Keywords Paper

data-intensive systems, SQL code smells, Code smells, database access

0

0

0

0

14:53

15/11/2020

Taming Type Annotations in Gradual Typing

John Peter Campora, Sheng Chen

Keywords Paper

variational types, gradual typing, cast errors

0

0

0

0

14:33

06/12/2021

Learning with Labeling Induced Abstentions

Kareem Amin, Giulia DeSalvo, Afshin Rostamizadeh

Keywords Paper

machine learning, active learning

0

0

0

0

11:22

07/09/2020

Semi-supervised Active Learning for Instance Segmentation via Scoring Predictions

Jun Wang, Shaoguo Wen, Jianghua Yu and
Kaixing Chen, Xin Zhou, Peng Gao, Guotong Xie, Changsheng Li

Keywords Paper

instance segmentation, active learning, semi-supervised learning, medical images

0

0

0

0

7:48

19/04/2021

Hidden biases in unreliable news detection datasets

Xiang Zhou, Heba Elfardy, Christos Christodoulopoulos and
Thomas Butler, Mohit Bansal

Keywords Paper

0

0

0

0

10:57

29/06/2020

An empirical study on regular expression bugs

Peipei Wang, Chris Brown, Jamie A. Jennings, Kathryn T. Stolee

Keywords Paper

pull requests, Regular expression bug characteristics, bug fixes

0

0

0

0

12:22

04/07/2020

Improving Truthfulness of Headline Generation

Kazuki Matsumaru, Sho Takase, Naoaki Okazaki

Keywords Paper

Truthfulness Generation, abstractive summarization, headline generation, automatic headlines

0

0

0

0

11:21

06/12/2021

DP-SSL: Towards Robust Semi-supervised Learning with A Few Labeled Samples

Yi Xu, Jiandong Ding, Lu Zhang, Shuigeng Zhou

Keywords Paper

deep learning, machine learning, semi-supervised learning

0

0

0

0

10:11

02/02/2021

LightXML: Transformer with Dynamic Negative Sampling for High-Performance Extreme Multi-label Text Classification

Ting Jiang, Deqing Wang, Leilei Sun and
Huayi Yang, Zhengyang Zhao, Fuzhen Zhuang

Keywords Paper

0

0

0

0

16:28

07/09/2020

Weakly-supervised Salient Instance Detection

Xin Tian, Ke Xu, Xin Yang and
Baocai Yin, Rynson Lau

Keywords Paper

Salient Instance Detection, SID, weak supervision, saliency detection, subitizing

0

0

0

0

8:22

12/08/2020

On Training Robust PDF Malware Classifiers

Yizheng Chen, Shiqi Wang, Dongdong She, Suman Jana

Keywords Paper

0

0

0

0

12:21

06/12/2020

Disentangling Human Error from Ground Truth in Segmentation of Medical Images

Le Zhang, Ryu Tanno, Moucheng Xu and
Chen Jin, Joseph Jacob, Olga Cicarrelli, Frederik Barkhof, Daniel Alexander

Keywords Paper

0

0

0

0

3:21

22/06/2020

Enriching Knowledge Bases with Interesting Negative Statements

Hiba Arnaout, Simon Razniewski, Gerhard Weikum

Keywords Paper

information retrieval, knowledge bases, ranking, negation

0

0

0

0

5:25

12/07/2020

Extreme Multi-label Classification from Aggregated Labels

Yanyao Shen, Hsiang-Fu Yu, Sujay Sanghavi, Inderjit Dhillon

Keywords Paper

Optimization - Large Scale, Parallel and Distributed

0

0

0

0

15:05

29/06/2020

Investigating severity thresholds for test smells

Davide Spadini, Martin Schvarcbacher, Ana-Maria Oprescu and
Magiel Bruntink, Alberto Bacchelli

Keywords Paper

Test Smells, Software Testing, Empirical Software Engineering

0

0

0

0

13:26

12/08/2020

Everything Old is New Again: Binary Security of WebAssembly

Daniel Lehmann, Johannes Kinder, Michael Pradel

Keywords Paper

0

0

0

0

12:11

18/07/2021

Delving into Deep Imbalanced Regression

Yuzhe Yang, Kaiwen Zha, YINGCONG CHEN and
Hao Wang, Dina Katabi

Keywords Paper

Applications

0

0

0

0

16:37

22/09/2020

Are we evaluating rigorously? Benchmarking recommendation for reproducible evaluation and fair comparison

Zhu Sun, Di Yu, Hui Fang and
Jie Yang, Xinghua Qu, Jie Zhang, Cong Geng

Keywords Paper

Benchmarks, Recommender Systems, Reproducible Evaluation

0

0

0

0

2:43

15/11/2020

Testing Consensus Implementations using Communication Closure

Cezara Drăgoi, Constantin Enea, Burcu Kulahcioglu Ozkan and
Rupak Majumdar, Filip Niksic

Keywords Paper

Distributed consensus, Communication closure, Randomized testing

0

0

0

0

15:19

04/07/2020

Word-level Textual Adversarial Attacking as Combinatorial Optimization

Yuan Zang, Fanchao Qi, Chenghao Yang and
Zhiyuan Liu, Meng Zhang, Qun Liu, Maosong Sun

Keywords Paper

Textual attacking, Word-level attacking, combinatorial problem, Word-level Attacking

0

0

0

0

9:34

15/06/2020

Can Applications Recover from fsync Failures?

Anthony Rebello, Yuvraj Patel, Ramnatthan Alagappan and
Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau

Keywords Paper

0

0

0

0

23:04

12/08/2020

MUZZ: Thread-aware Grey-box Fuzzing for Effective Bug Hunting in Multithreaded Programs

Hongxu Chen, Shengjian Guo, Yinxing Xue and
Yulei Sui, Cen Zhang, Yuekang Li, Haijun Wang, Yang Liu

Keywords Paper

0

0

0

0

10:59

04/07/2020

Beyond Accuracy: Behavioral Testing of NLP Models with CheckList

Marco Tulio Ribeiro, Tongshuang Wu, Carlos Guestrin, Sameer Singh

Keywords Paper

measuring accuracy, generalization, behavioral testing, software engineering

0

0

0

0

11:40

19/08/2021

Beyond Accuracy: Behavioral Testing of NLP Models with Checklist (Extended Abstract)

Marco Tulio Ribeiro, Tongshuang Wu, Carlos Guestrin, Sameer Singh

Keywords Paper

Natural Language Processing, Resources and Evaluation, NLP Applications and Tools, Text Classification, Question Answering

0

0

0

0

14:26

22/09/2020

Revisiting adversarially learned injection attacks against recommender systems

Jiaxi Tang, Hongyi Wen, Ke Wang

Keywords Paper

Recommender System, Security and Privacy, Adversarial Machine Learning

0

0

0

0

2:13

14/06/2020

Transferable, Controllable, and Inconspicuous Adversarial Attacks on Person Re-identification With Deep Mis-Ranking

Hongjun Wang, Guangrun Wang, Ya Li and
Dongyu Zhang, Liang Lin

Keywords Paper

robustness of person reidentification, security of reid system, adversarial attack

0

0

0

0

4:49

25/07/2020

A general knowledge distillation framework for counterfactual recommendation via uniform data

Dugang Liu, Pengxiang Cheng, Zhenhua Dong and
Xiuqiang He, Weike Pan, Zhong Ming

Keywords Paper

counterfactual learning, uniform data, recommender systems, knowledge distillation

0

0

0

0

14:06

15/11/2020

Actor Concurrency Bugs: A Comprehensive Study on Symptoms, Root Causes, API Usages, and Differences

Mehdi Bagherzadeh, Nicholas Fireman, Anas Shawesh, Raffi Khatchadourian

Keywords Paper

GitHub, Akka actor bugs, Stack Overflow, Actor bug API usages, Actor bug root causes, Actor bug symptoms, Actor bug differences

0

0

0

0

18:19

12/08/2020

MVP: Detecting Vulnerabilities using Patch-Enhanced Vulnerability Signatures

Yang Xiao, Bihuan Chen, Chendong Yu and
Zhengzi Xu, Zimu Yuan, Feng Li, Binghong Liu, Yang Liu, Wei Huo, Wei Zou, Wenchang Shi

Keywords Paper

0

0

0

0

11:35

06/12/2021

Adversarial Examples Make Strong Poisons

Liam Fowl, Micah Goldblum, Ping-yeh Chiang and
Jonas Geiping, Wojciech Czaja, Tom Goldstein

Keywords Paper

machine learning, robustness, adversarial robustness and security

0

0

0

0

14:05

04/11/2020

Automated Reasoning and Detection of Specious Configuration in Large Systems with Symbolic Execution

Yigong Hu, Gongqi Huang, Peng Huang

Keywords Paper

0

0

0

0

18:44

19/04/2021

Expanding, retrieving and infilling: Diversifying cross-domain question generation with flexible templates

Xiaojing Yu, Anxiao Jiang

Keywords Paper

0

0

0

0

11:40

16/11/2020

SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup

Rongzhi Zhang, Yue Yu, Chao Zhang

Keywords Paper

low-resource tasks, active labeling, mixup, sequence mixup

0

0

0

0

11:16

02/02/2021

Curse or Redemption? How Data Heterogeneity Affects the Robustness of Federated Learning

Syed Zawad, Ahsan Ali, Pin-Yu Chen and
Ali Anwar, Yi Zhou, Nathalie Baracaldo, Yuan Tian, Feng Yan

Keywords Paper

0

0

0

0

19:26

07/06/2020

Toward a Better Performance Evaluation Framework for Fake News Classification

Lia Bozarth, Ceren Budak

Keywords Paper

bias, classification, classifiers, communities, fake, fake news, impact, news, performance, sites, topic

0

0

0

0

9:54

02/02/2021

Diagnose Like A Pathologist: Weakly-Supervised Pathologist-Tree Network for Slide-Level Immunohistochemical Scoring

Zhen Chen, Jun Zhang, Shuanlong Che and
Junzhou Huang, Xiao Han, Yixuan Yuan

Keywords Paper

0

0

0

0

13:07

02/02/2021

EvaLDA: Efficient Evasion Attacks Towards Latent Dirichlet Allocation

Qi Zhou, Haipeng Chen, Yitao Zheng, Zhen Wang

Keywords Paper

0

0

0

0

19:28

06/12/2020

No Subclass Left Behind: Fine-Grained Robustness in Coarse-Grained Classification Problems

Nimit Sohoni, Jared Dunnmon, Geoffrey Angus and
Albert Gu, Christopher Ré

Keywords Paper

0

0

0

0

3:18

15/11/2020

Learning Semantic Program Embeddings with Graph Interval Neural Network

Yu Wang, Ke Wang, Fengjuan Gao, Linzhang Wang

Keywords Paper

Intervals, Control-flow graphs, Null pointer dereference detection, Graph neural networks, Program embeddings

0

0

0

0

10:36

26/04/2020

Measuring the Reliability of Reinforcement Learning Algorithms

Stephanie C.Y. Chan, Samuel Fishman, Anoop Korattikara and
John Canny, Sergio Guadarrama

Keywords Paper

reinforcement learning, metrics, statistics, reliability

0

0

0

0

5:32