TY - GEN
T1 - Approximation Algorithms for Submodular Data Summarization with a Knapsack Constraint
AU - Han, Kai
AU - Cui, Shuang
AU - Zhu, Tianshuai
AU - Zhang, Enpei
AU - Wu, Benwei
AU - Yin, Zhizhuo
AU - Xu, Tong
AU - Tang, Shaojie
AU - Huang, He
N1 - Publisher Copyright: © 2021 Owner/Author.
PY - 2021/5/31
Y1 - 2021/5/31
N2 - Data summarization, a fundamental methodology aimed at selecting a representative subset of data elements from a large pool of ground data, has found numerous applications in big data processing, such as social network analysis [5, 7], crowdsourcing [6], clustering [4], network design [13], and document/corpus summarization [14]. Moreover, it is well acknowledged that the "representativeness"of a dataset in data summarization applications can often be modeled by submodularity-a mathematical concept abstracting the "diminishing returns"property in the real world. Therefore, a lot of studies have cast data summarization as a submodular function maximization problem (e.g., [2]).
AB - Data summarization, a fundamental methodology aimed at selecting a representative subset of data elements from a large pool of ground data, has found numerous applications in big data processing, such as social network analysis [5, 7], crowdsourcing [6], clustering [4], network design [13], and document/corpus summarization [14]. Moreover, it is well acknowledged that the "representativeness"of a dataset in data summarization applications can often be modeled by submodularity-a mathematical concept abstracting the "diminishing returns"property in the real world. Therefore, a lot of studies have cast data summarization as a submodular function maximization problem (e.g., [2]).
KW - data summarization
KW - machine learning
KW - submodular function maximization
UR - https://www.scopus.com/pages/publications/85108566082
U2 - 10.1145/3410220.3453922
DO - 10.1145/3410220.3453922
M3 - Conference contribution
T3 - SIGMETRICS 2021 - Abstract Proceedings of the 2021 ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer Systems
SP - 65
EP - 66
BT - SIGMETRICS 2021 - Abstract Proceedings of the 2021 ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer Systems
PB - Association for Computing Machinery, Inc
T2 - 2021 ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS 2021
Y2 - 14 June 2021 through 18 June 2021
ER -