Truth Discovery & Crowdsourcing

Overview

Rapid industrialization, greater affordability of devices, the explosion of mobile networks, cloud based infrastructure and advanced technologies have given rise to incomprehensibly large worlds of information, described as Big Data. One important property of Big Data is its greater variety of sources, i.e., data about the same object can be obtained from various sources. For example, customer information can be found from multiple databases in a company; a patient's medical records may be scattered at different hospitals; a news event can be characterized by text, images, and video; and an activity can be captured by multiple surveillance cameras and live video feeds. Many interesting patterns reside across all heterogeneous data sources available. One solution to the problem of learning from multiple sources is to extract trustworthy information from different sources and integrate their complimentary perspectives to reach a more accurate and robust decision.


His research topics related to Truth discovery includes:

  • Truth discovery approaches with theoretical guarantees
  • Privacy-preserving truth discovery
  • Crowd Sensings

Representative work

"A Joint Maximum Likelihood Estimation Framework for Truth Discovery: A Unified Perspective", with Shiyu Wang, 2022.
IEEE Transactions on Knowledge and Data Engineering.
Truth Discovery Crowdsourcing Joint Maximum Likelihood Estimation Profile Likelihood Estimation Asymptotic Consistency
"Towards differentially private truth discovery for crowd sensing systems", with Yaliang Li, Zhan Qin, Chenglin Miao, Lu Su, Jing Gao Kui Ren and Bolin Ding. 2020.
IEEE International Conference on Distributed Computing Systems (ICDCS).
Truth Discovery Differential Privacy Data Aggregation
"IProWA: A novel probabilistic graphical model for crowdsourcing aggregation", with Tianqi Wang, Fenglong Ma and Jing Gao. 2019.
IEEE International Conference on Big Data (Big Data).
Data Mining Crowdsourcing Item Parameter Estimation Probabilistic Graphic Model
"Privacy-preserving truth discovery in crowd sensing systems", with Chenglin Miao, Wenjun Jiang, Lu Su, Yaliang Li, Suxin Guo, Zhan Qin, Jing Gao and Kui Ren. 2019. ACM Transactions on Sensor Networks.
Crowd Sensing Truth Discovery Privacy-Preserving
"Towards confidence interval estimation in truth discovery", with Jing Gao, Qi Li, Fenglong Ma, Lu Su, Yunlong Feng and Aidong Zhang. 2018.
IEEE Transactions on Knowledge and Data Engineering.
Truth Discovery Data Mining Confidence Interval Estimation Bootstrapping
"Incentive mechanism for privacy-aware data aggregation in mobile crowd sensing systems", with Haiming Jin, Lu Su and Klara Nahrstedt. 2018.
IEEE/ACM Transactions on Networking.
Machine Learning Incentive Mechanism Data Aggregation Privacy Preservation Mobile Crowd Sensing
"Towards data poisoning attacks in crowd sensing systems", with Chenglin Miao, Qi Li, Wenjun Jiang, Mengdi Huai and Lu Su. 2018.
ACM International Symposium on Mobile Ad Hoc Networking and Computing (MobiHoc).
Crowd Sensing Data Mining Truth Discovery Data Posioning
"Inception: Incentivizing privacy-preserving data aggregation for mobile crowd sensing systems", with Haiming Jin, Lu Su and Klara Nahrstedt. 2016.
ACM International Symposium on Mobile Ad Hoc Networking and Computing (MobiHoc).
Crowd Sensing Incentive Mechanism Data Aggregation Truth Discovery Privacy
"Towards Confidence in the Truth: A Bootstrapping based Truth Discovery Approach", with Jing Gao, Qi Li, Fenglong Ma, Lu Su, Yunlong Feng and Aidong Zhang. 2016. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD).
Truth Discovery Data Veracity Optimization Bootstrapping
"A Truth Discovery Approach with Theoretical Guarantee", with Jing Gao, Zhaoran Wang, Shiyu Wang, Lu Su and Han Liu. 2016.
ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD).
Truth Discovery Mixture Model Asymptotic Consistency