KDD '15- Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
Full Citation in the ACM Digital Library
SESSION: Keynote Talks
Online Controlled Experiments: Lessons from Running A/B/n Tests for 12 Years
Ron Kohavi
MOOCS: What Have We Learned?
Daphne Koller
Machine Learning and Causal Inference for Policy Evaluation
Susan Athey
Data, Knowledge and Discovery: Machine Learning meets Natural Science
Hugh Durrant-Whyte
SESSION: Research Paper Presentations (Part 1)
Large-Scale Distributed Bayesian Matrix Factorization using Stochastic Gradient MCMC
Sungjin Ahn
Anoop Korattikara
Nathan Liu
Suju Rajan
Max Welling
TimeMachine: Timeline Generation for Knowledge-Base Entities
Tim Althoff
Xin Luna Dong
Kevin Murphy
Safa Alai
Van Dang
Wei Zhang
Estimating Local Intrinsic Dimensionality
Laurent Amsaleg
Oussama Chelly
Teddy Furon
Stéphane Girard
Michael E. Houle
Ken-ichi Kawarabayashi
Michael Nett
Portraying Collective Spatial Attention in Twitter
Émilien Antoine
Adam Jatowt
Shoko Wakamiya
Yukiko Kawai
Toyokazu Akiyama
Accelerating Dynamic Time Warping Clustering with a Novel Admissible Pruning Strategy
Nurjahan Begum
Liudmila Ulanova
Jun Wang
Eamonn Keogh
Efficient Online Evaluation of Big Data Stream Classifiers
Albert Bifet
Gianmarco de Francisci Morales
Jesse Read
Geoff Holmes
Bernhard Pfahringer
Dynamically Modeling Patient's Health State from Electronic Medical Records: A Time Series Approach
Karla L. Caballero Barajas
Ram Akella
Facets: Fast Comprehensive Mining of Coevolving High-order Time Series
Yongjie Cai
Hanghang Tong
Wei Fan
Ping Ji
Qing He
Online Outlier Exploration Over Large Datasets
Lei Cao
Mingrui Wei
Di Yang
Elke A. Rundensteiner
BatchRank: A Novel Batch Mode Active Learning Framework for Hierarchical Classification
Shayok Chakraborty
Vineeth Balasubramanian
Adepu Ravi Sankar
Sethuraman Panchanathan
Jieping Ye
On the Formation of Circles in Co-authorship Networks
Tanmoy Chakraborty
Sikhar Patranabis
Pawan Goyal
Animesh Mukherjee
Heterogeneous Network Embedding via Deep Architectures
Shiyu Chang
Wei Han
Jiliang Tang
Guo-Jun Qi
Charu C. Aggarwal
Thomas S. Huang
Differentially Private High-Dimensional Data Publication via Sampling-Based Inference
Rui Chen
Qian Xiao
Yu Zhang
Jianliang Xu
Efficient Algorithms for Public-Private Social Networks
Flavio Chierichetti
Alessandro Epasto
Ravi Kumar
Silvio Lattanzi
Vahab Mirrokni
Warm Start for Parameter Selection of Linear Classifiers
Bo-Yu Chu
Chia-Hua Ho
Cheng-Hao Tsai
Chieh-Yen Lin
Chih-Jen Lin
Stream Sampling for Frequency Cap Statistics
Edith Cohen
Adaptation Algorithm and Theory Based on Generalized Discrepancy
Corinna Cortes
Mehryar Mohri
Andrés Muñoz Medina
Optimal Action Extraction for Random Forests and Boosted Trees
Zhicheng Cui
Wenlin Chen
Yujie He
Yixin Chen
Dynamic Matrix Factorization with Priors on Unknown Values
Robin Devooght
Nicolas Kourtellis
Amin Mantrach
CoupledLP: Link Prediction in Coupled Networks
Yuxiao Dong
Jing Zhang
Jie Tang
Nitesh V. Chawla
Bai Wang
Unsupervised Feature Selection with Adaptive Structure Learning
Liang Du
Yi-Dong Shen
Dirichlet-Hawkes Processes with Applications to Clustering Continuous-Time Document Streams
Nan Du
Mehrdad Farajtabar
Amr Ahmed
Alexander J. Smola
Le Song
Beyond Triangles: A Distributed Framework for Estimating 3-profiles of Large Graphs
Ethan R. Elenberg
Karthikeyan Shanmugam
Michael Borokhovich
Alexandros G. Dimakis
Hierarchical Graph-Coupled HMMs for Heterogeneous Personalized Health Data
Kai Fan
Marisa Eisenberg
Alison Walsh
Allison Aiello
Katherine Heller
More Constraints, Smaller Coresets: Constrained Matrix Approximation of Sparse Big Data
Dan Feldman
Tamir Tassa
Certifying and Removing Disparate Impact
Michael Feldman
Sorelle A. Friedler
John Moeller
Carlos Scheidegger
Suresh Venkatasubramanian
RSC: Mining and Modeling Temporal Activity in Social Media
Alceu Ferraz Costa
Yuto Yamaguchi
Agma Juci Machado Traina
Caetano Traina, Jr.
Christos Faloutsos
A Clustering-Based Framework to Control Block Sizes for Entity Resolution
Jeffrey Fisher
Peter Christen
Qing Wang
Erhard Rahm
Who Supported Obama in 2012?: Ecological Inference through Distribution Regression
Seth R. Flaxman
Yu-Xiang Wang
Alexander J. Smola
Real Estate Ranking via Mixed Land-use Latent Models
Yanjie Fu
Guannan Liu
Spiros Papadimitriou
Hui Xiong
Yong Ge
Hengshu Zhu
Chen Zhu
Adaptive Message Update for Fast Affinity Propagation
Yasuhiro Fujiwara
Makoto Nakatsuji
Hiroaki Shiokawa
Yasutoshi Ida
Machiko Toyoda
Monitoring Least Squares Models of Distributed Streams
Moshe Gabel
Daniel Keren
Assaf Schuster
Reconstructing Textual Documents from n-grams
Matthias Gallé
Matías Tealdi
Anatomical Annotations for Drosophila Gene Expression Patterns via Multi-Dimensional Visual Descriptors Integration: Multi-Dimensional Feature Learning
Hongchang Gao
Lin Yan
Weidong Cai
Heng Huang
Selective Hashing: Closing the Gap between Radius Search and k-NN Search
Jinyang Gao
H.V. Jagadish
Beng Chin Ooi
Sheng Wang
Using Local Spectral Methods to Robustify Graph-Based Learning Algorithms
David F. Gleich
Michael W. Mahoney
Instance Weighting for Patient-Specific Risk Stratification Models
Jen J. Gong
Thoralf M. Sundt
James D. Rawn
John V. Guttag
A Deep Hybrid Model for Weather Forecasting
Aditya Grover
Ashish Kapoor
Eric Horvitz
Network Lasso: Clustering and Optimization in Large Graphs
David Hallac
Jure Leskovec
Stephen Boyd
Learning Tree Structure in Multi-Task Learning
Lei Han
Yu Zhang
Probabilistic Community and Role Model for Social Networks
Yu Han
Jie Tang
Real-Time Top-R Topic Detection on Twitter with Topic Hijack Filtering
Kohei Hayashi
Takanori Maehara
Masashi Toyoda
Ken-ichi Kawarabayashi
Non-exhaustive, Overlapping Clustering via Low-Rank Semidefinite Programming
Yangyang Hou
Joyce Jiyoung Whang
David F. Gleich
Inderjit S. Dhillon
Inferring Air Quality for Station Location Recommendation Based on Urban Big Data
Hsun-Ping Hsieh
Shou-De Lin
Yu Zheng
Website Optimization Problem and Its Solutions
Shuhei Iitsuka
Yutaka Matsuo
Reciprocity in Social Networks with Capacity Constraints
Bo Jiang
Zhi-Li Zhang
Don Towsley
Learning with Similarity Functions on Graphs using Matchings of Geometric Embeddings
Fredrik D. Johansson
Devdatt Dubhashi
Structured Hedging for Resource Allocations with Leverage
Nicholas Johnson
Arindam Banerjee
Improved Bounds on the Dot Product under Random Projection and Random Sign Projection
Ata Kaban
Accelerated Alternating Direction Method of Multipliers
Mojtaba Kadkhodaie
Konstantina Christakopoulou
Maziar Sanjabi
Arindam Banerjee
Deep Computational Phenotyping
Zhengping Che
David Kale
Wenzhe Li
Mohammad Taha Bahadori
Yan Liu
Leveraging Social Context for Modeling Topic Evolution
Janani Kalyanam
Amin Mantrach
Diego Saez-Trumper
Hossein Vahabi
Gert Lanckriet
Scalable Blocking for Privacy Preserving Record Linkage
Alexandros Karakasidis
Georgia Koloniari
Vassilios S. Verykios
Real Time Recommendations from Connoisseurs
Noriaki Kawamae
Towards Decision Support and Goal Achievement: Identifying Action-Outcome Relationships From Social Media
Emre KıcKıman
Matthew Richardson
On Estimating the Swapping Rate for Categorical Data
Daniel Kifer
Simultaneous Discovery of Common and Discriminative Topics via Joint Nonnegative Matrix Factorization
Hannah Kim
Jaegul Choo
Jingu Kim
Chandan K. Reddy
Haesun Park
A Decision Tree Framework for Spatiotemporal Sequence Prediction
Taehwan Kim
Yisong Yue
Sarah Taylor
Iain Matthews
TOPTRAC: Topical Trajectory Pattern Mining
Younghoon Kim
Jiawei Han
Cangzhou Yuan
From Group to Individual Labels Using Deep Features
Dimitrios Kotzias
Misha Denil
Nando de Freitas
Padhraic Smyth
VEWS: A Wikipedia Vandal Early Warning System
Srijan Kumar
Francesca Spezzano
V.S. Subrahmanian
Unified and Contrasting Cuts in Multiple Graphs: Application to Medical Imaging Segmentation
Chia-Tung Kuo
Xiang Wang
Peter Walker
Owen Carmichael
Jieping Ye
Ian Davidson
Reducing the Unlabeled Sample Complexity of Semi-Supervised Multi-View Learning
Chao Lan
Jun Huan
Maximum Likelihood Postprocessing for Differential Privacy under Consistency Constraints
Jaewoo Lee
Yue Wang
Daniel Kifer
Online Influence Maximization
Siyu Lei
Silviu Maniu
Luyi Mo
Reynold Cheng
Pierre Senellart
The Child is Father of the Man: Foresee the Success at the Early Stage
Liangyue Li
Hanghang Tong
0-Bit Consistent Weighted Sampling
Ping Li
On the Discovery of Evolving Truth
Yaliang Li
Qi Li
Jing Gao
Lu Su
Bo Zhao
Wei Fan
Jiawei Han
MASCOT: Memory-efficient and Accurate Sampling for Counting Local Triangles in Graph Streams
Yongsub Lim
U Kang
A Learning-based Framework to Handle Multi-round Multi-party Influence Maximization on Social Networks
Su-Chen Lin
Shou-De Lin
Ming-Syan Chen
Temporal Phenotyping from Longitudinal Electronic Health Records: A Graph Based Framework
Chuanren Liu
Fei Wang
Jianying Hu
Hui Xiong
Spectral Ensemble Clustering
Hongfu Liu
Tongliang Liu
Junjie Wu
Dacheng Tao
Yun Fu
Fast and Memory-Efficient Significant Pattern Mining via Permutation Testing
Felipe Llinares-López
Mahito Sugiyama
Laetitia Papaxanthos
Karsten Borgwardt
Influence at Scale: Distributed Computation of Complex Contagion in Networks
Brendan Lucier
Joel Oren
Yaron Singer
FaitCrowd: Fine Grained Truth Discovery for Crowdsourced Data Aggregation
Fenglong Ma
Yaliang Li
Qi Li
Minghui Qiu
Jing Gao
Shi Zhi
Lu Su
Bo Zhao
Heng Ji
Jiawei Han
Algorithmic Cartography: Placing Points of Interest and Ads on Maps
Mohammad Mahdian
Okke Schrijvers
Sergei Vassilvitskii
Dimensionality Reduction Via Graph Structure Learning
Qi Mao
Li Wang
Steve Goodison
Yijun Sun
Robust Treecode Approximation for Kernel Machines
William B. March
Bo Xiao
Sameer Tharakan
Chenhan D. Yu
George Biros
Inferring Networks of Substitutable and Complementary Products
Julian McAuley
Rahul Pandey
Jure Leskovec
SESSION: Research Paper Presentations (Part 2)
Data-Driven Activity Prediction: Algorithms, Evaluation Methodology, and Applications
Bryan Minor
Janardhan Rao Doppa
Diane J. Cook
Scalable Large Near-Clique Detection in Large-Scale Networks via Sampling
Michael Mitzenmacher
Jakub Pachocki
Richard Peng
Charalampos Tsourakakis
Shen Chen Xu
Graph Query Reformulation with Diversity
Davide Mottin
Francesco Bonchi
Francesco Gullo
Flexible and Robust Multi-Network Clustering
Jingchao Ni
Hanghang Tong
Wei Fan
Xiang Zhang
Extreme States Distribution Decomposition Method for Search Engine Online Evaluation
Kirill Nikolaev
Alexey Drutsa
Ekaterina Gladkikh
Alexander Ulianov
Gleb Gusev
Pavel Serdyukov
Simultaneous Modeling of Multiple Diseases for Mortality Prediction in Acute Hospital Care
Nozomi Nori
Hisashi Kashima
Kazuto Yamashita
Hiroshi Ikai
Yuichi Imanaka
Fast and Robust Parallel SGD Matrix Factorization
Jinoh Oh
Wook-Shin Han
Hwanjo Yu
Xiaoqian Jiang
Efficient PageRank Tracking in Evolving Networks
Naoto Ohsaka
Takanori Maehara
Ken-ichi Kawarabayashi
Quick Sensitivity Analysis for Incremental Data Modification and Its Application to Leave-one-out CV in Linear Classification Problems
Shota Okumura
Yoshiki Suzuki
Ichiro Takeuchi
Non-transitive Hashing with Latent Similarity Components
Mingdong Ou
Peng Cui
Fei Wang
Jun Wang
Wenwu Zhu
Optimal Kernel Group Transformation for Exploratory Regression Analysis and Graphics
Pan Chao
Qiming Huang
Michael Zhu
Discovering and Exploiting Deterministic Label Relationships in Multi-Label Learning
Christina Papagiannopoulou
Grigorios Tsoumakas
Ioannis Tsamardinos
Subspace Clustering Using Log-determinant Rank Approximation
Chong Peng
Zhao Kang
Huiqing Li
Qiang Cheng
A PCA-Based Change Detection Framework for Multidimensional Data Streams: Change Detection in Multidimensional Data Streams
Abdulhakim A. Qahtan
Basma Alharbi
Suojin Wang
Xiangliang Zhang
State-Driven Dynamic Sensor Selection and Prediction with State-Stacked Sparseness
Guo-Jun Qi
Charu Aggarwal
Deepak Turaga
Daby Sow
Phil Anno
SCRAM: A Sharing Considered Route Assignment Mechanism for Fair Taxi Route Recommendations
Shiyou Qian
Jian Cao
Frédéric Le Mouël
Issam Sahel
Minglu Li
Locally Densest Subgraph Discovery
Lu Qin
Rong-Hua Li
Lijun Chang
Chengqi Zhang
Virus Propagation in Multiple Profile Networks
Angeliki Rapti
Spyros Sioutas
Kostas Tsichlas
Giannis Tzimas
Collective Opinion Spam Detection: Bridging Review Networks and Metadata
Shebuti Rayana
Leman Akoglu
ClusType: Effective Entity Recognition and Typing by Relation Phrase-Based Clustering
Xiang Ren
Ahmed El-Kishky
Chi Wang
Fangbo Tao
Clare R. Voss
Jiawei Han
Mining Frequent Itemsets through Progressive Sampling with Rademacher Averages
Matteo Riondato
Eli Upfal
Why It Happened: Identifying and Modeling the Reasons of the Happening of Social Events
Yu Rong
Hong Cheng
Zhiyu Mo
Matrix Completion with Queries
Natali Ruchansky
Mark Crovella
Evimaria Terzi
Stochastic Divergence Minimization for Online Collapsed Variational Bayes Zero Inference of Latent Dirichlet Allocation
Issei Sato
Hiroshi Nakagawa
Bayesian Poisson Tensor Factorization for Inferring Multilateral Relations from Sparse Dyadic Event Counts
Aaron Schein
John Paisley
David M. Blei
Hanna Wallach
TimeCrunch: Interpretable Dynamic Graph Summarization
Neil Shah
Danai Koutra
Tianmin Zou
Brian Gallagher
Christos Faloutsos
Inside Jokes: Identifying Humorous Cartoon Captions
Dafna Shahaf
Eric Horvitz
Robert Mankoff
Community Detection based on Distance Dynamics
Junming Shao
Zhichao Han
Qinli Yang
Tao Zhou
Discovery of Meaningful Rules in Time Series
Mohammad Shokoohi-Yekta
Yanping Chen
Bilson Campana
Bing Hu
Jesin Zakaria
Eamonn Keogh
An Evaluation of Parallel Eccentricity Estimation Algorithms on Undirected Real-World Graphs
Julian Shun
Efficient Latent Link Recommendation in Signed Networks
Dongjin Song
David A. Meyer
Dacheng Tao
Turn Waste into Wealth: On Simultaneous Clustering and Cleaning over Dirty Data
Shaoxu Song
Chunping Li
Xiaoquan Zhang
Set Cover at Web Scale
Stergios Stergiou
Kostas Tsioutsiouliklis
Exploiting Relevance Feedback in Knowledge Graph Search
Yu Su
Shengqi Yang
Huan Sun
Mudhakar Srivatsa
Sue Kase
Michelle Vanni
Xifeng Yan
LINKAGE: An Approach for Comprehensive Risk Prediction for Care Management
Zhaonan Sun
Fei Wang
Jianying Hu
Transitive Transfer Learning
Ben Tan
Yangqiu Song
Erheng Zhong
Qiang Yang
PTE: Predictive Text Embedding through Large-scale Heterogeneous Text Networks
Jian Tang
Meng Qu
Qiaozhu Mei
An Effective Marketing Strategy for Revenue Maximization with a Quantity Constraint
Ya-Wen Teng
Chih-Hua Tai
Philip S. Yu
Ming-Syan Chen
Scaling Up Stochastic Dual Coordinate Ascent
Kenneth Tran
Saghar Hosseini
Lin Xiao
Thomas Finley
Mikhail Bilenko
Discovering Valuable items from Massive Data
Hastagiri P. Vanchinathan
Andreas Marfurt
Charles-Antoine Robelin
Donald Kossmann
Andreas Krause
Deep Learning Architecture with Dynamically Programmed Layers for Brain Connectome Prediction
Vivek Veeriah
Rohit Durvasula
Guo-Jun Qi
Incorporating World Knowledge to Document Clustering via Heterogeneous Information Networks
Chenguang Wang
Yangqiu Song
Ahmed El-Kishky
Dan Roth
Ming Zhang
Jiawei Han
Towards Interactive Construction of Topical Hierarchy: A Recursive Tensor Decomposition Approach
Chi Wang
Xueqing Liu
Yanglei Song
Jiawei Han
Collaborative Deep Learning for Recommender Systems
Hao Wang
Naiyan Wang
Dit-Yan Yeung
Trading Interpretability for Accuracy: Oblique Treed Sparse Additive Models
Jialei Wang
Ryohei Fujimaki
Yosuke Motohashi
Geo-SAGE: A Geographical Sparse Additive Generative Model for Spatial Item Recommendation
Weiqing Wang
Hongzhi Yin
Ling Chen
Yizhou Sun
Shazia Sadiq
Xiaofang Zhou
Rubik: Knowledge Guided Tensor Factorization and Completion for Health Data Analytics
Yichen Wang
Robert Chen
Joydeep Ghosh
Joshua C. Denny
Abel Kho
You Chen
Bradley A. Malin
Jimeng Sun
Regularity and Conformity: Location Prediction Using Heterogeneous Mobility Data
Yingzi Wang
Nicholas Jing Yuan
Defu Lian
Linli Xu
Xing Xie
Enhong Chen
Yong Rui
Dynamic Poisson Autoregression for Influenza-Like-Illness Case Count Prediction
Zheng Wang
Prithwish Chakraborty
Sumiko R. Mekaru
John S. Brownstein
Jieping Ye
Naren Ramakrishnan
Cinema Data Mining: The Smell of Fear
Jörg Wicker
Nicolas Krauter
Bettina Derstorff
Christof Stönner
Efstratios Bourtsoukidis
Thomas Klüpfel
Jonathan Williams
Stefan Kramer
Predicting Winning Price in Real Time Bidding with Censored Data
Wush Chi-Hsuan Wu
Mi-Yen Yeh
Ming-Syan Chen
Diversifying Restricted Boltzmann Machine for Document Modeling
Pengtao Xie
Yuntian Deng
Eric Xing
Edge-Weighted Personalized PageRank: Breaking A Decade-Old Performance Barrier
Wenlei Xie
David Bindel
Alan Demers
Johannes Gehrke
Petuum: A New Platform for Distributed Machine Learning on Big Data
Eric P. Xing
Qirong Ho
Wei Dai
Jin-Kyu Kim
Jinliang Wei
Seunghak Lee
Xun Zheng
Pengtao Xie
Abhimanu Kumar
Yaoliang Yu
Longitudinal LASSO: Jointly Learning Features and Temporal Contingency for Outcome Prediction
Tingyang Xu
Jiangwen Sun
Jinbo Bi
Performance Modeling and Scalability Optimization of Distributed Deep Learning Systems
Feng Yan
Olatunji Ruwase
Yuxiong He
Trishul Chilimbi
Deep Graph Kernels
Pinar Yanardag
S.V.N. Vishwanathan
Model Multiple Heterogeneity via Hierarchical Multi-Latent Space Learning
Pei Yang
Jingrui He
Structural Graphical Lasso for Learning Mouse Brain Connectivity
Sen Yang
Qian Sun
Shuiwang Ji
Peter Wonka
Ian Davidson
Jieping Ye
Entity Matching across Heterogeneous Sources
Yang Yang
Yizhou Sun
Jie Tang
Bo Ma
Juanzi Li
An Efficient Semi-Supervised Clustering Algorithm with Sequential Constraints
Jinfeng Yi
Lijun Zhang
Tianbao Yang
Wei Liu
Jun Wang
Assembler: Efficient Discovery of Spatial Co-evolving Patterns in Massive Geo-sensory Data
Chao Zhang
Yu Zheng
Xiuli Ma
Jiawei Han
Dynamic Topic Modeling for Monitoring Market Competition from Online Text and Image Data
Hao Zhang
Gunhee Kim
Eric P. Xing
Organizational Chart Inference
Jiawei Zhang
Philip S. Yu
Yuanhua Lv
Panther: Fast Top-k Similarity Search on Large Networks
Jing Zhang
Jie Tang
Cong Ma
Hanghang Tong
Yu Jing
Juanzi Li
A Collective Bayesian Poisson Factorization Model for Cold-start Local Event Recommendation
Wei Zhang
Jianyong Wang
Statistical Arbitrage Mining for Display Advertising
Weinan Zhang
Jun Wang
Deep Model Based Transfer and Multi-Task Learning for Biological Image Analysis
Wenlu Zhang
Rongjian Li
Tao Zeng
Qian Sun
Sudhir Kumar
Jieping Ye
Shuiwang Ji
COSNET: Connecting Heterogeneous Social Networks with Local and Global Consistency
Yutao Zhang
Jie Tang
Zhilin Yang
Jian Pei
Philip S. Yu
SAME but Different: Fast and High Quality Gibbs Parameter Estimation
Huasha Zhao
Biye Jiang
John F. Canny
Bobby Jaros
Multi-Task Learning for Spatio-Temporal Event Forecasting
Liang Zhao
Qian Sun
Jieping Ye
Feng Chen
Chang-Tien Lu
Naren Ramakrishnan
SEISMIC: A Self-Exciting Point Process Model for Predicting Tweet Popularity
Qingyuan Zhao
Murat A. Erdogdu
Hera Y. He
Anand Rajaraman
Jure Leskovec
Linear Time Samplers for Supervised Topic Models using Compositional Proposals
Xun Zheng
Yaoliang Yu
Eric P. Xing
L∞ Error and Bandwidth Selection for Kernel Density Estimates of Large Data
Yan Zheng
Jeff M. Phillips
Modeling Truth Existence in Truth Discovery
Shi Zhi
Bo Zhao
Wenzhu Tong
Jing Gao
Dian Yu
Heng Ji
Jiawei Han
Cuckoo Linear Algebra
Li Zhou
David G. Andersen
Mu Li
Alexander J. Smola
Integrating Vertex-centric Clustering with Edge-centric Clustering for Meta Path Graph Analysis
Yang Zhou
Ling Liu
David Buttler
Modeling User Mobility for Location Promotion in Location-based Social Networks
Wen-Yuan Zhu
Wen-Chih Peng
Ling-Jyh Chen
Kai Zheng
Xiaofang Zhou
Co-Clustering based Dual Prediction for Cargo Pricing Optimization
Yada Zhu
Hongxia Yang
Jingrui He
Debiasing Crowdsourced Batches
Honglei Zhuang
Aditya Parameswaran
Dan Roth
Jiawei Han
Query Workloads for Data Series Indexes
Kostas Zoumpatianos
Yin Lou
Themis Palpanas
Johannes Gehrke
SESSION: Industry & Government Track Invited Talks
Scaling Machine Learning and Statistics for Web Applications
Deepak Agarwal
Hadoop's Impact on the Future of Data Management
Amr Awadallah
Should You Trust Your Money to a Robot?
Vasant Dhar
Data Science at Visa
Waqar Hasan
Min Wang
How Artificial Intelligence and Big Data Created Rocket Fuel: A Case Study
George John
Optimizing Marketing Impact through Data Driven Decisioning
Anil Kamath
Powering Real-time Decision Engines in Finance and Healthcare using Open Source Software
Bassel Ojjeh
Clouded Intelligence
Joseph Sirosh
Data Science from the Lab to the Field to the Enterprise
Christopher White
User Modeling in Telecommunications and Internet Industry
Qiang Yang
SESSION: Industry & Government Track Papers
The Effectiveness of Marketing Strategies in Social Media: Evidence from Promotional Events
Panagiotis Adamopoulos
Vilma Todri
Personalizing LinkedIn Feed
Deepak Agarwal
Bee-Chung Chen
Qi He
Zhenhao Hua
Guy Lebanon
Yiming Ma
Pannagadatta Shivaswamy
Hsiao-Ping Tseng
Jaewon Yang
Liang Zhang
Whither Social Networks for Web Search?
Rakesh Agrawal
Behzad Golshan
Evangelos Papalexakis
Exploiting Data Mining for Authenticity Assessment and Protection of High-Quality Italian Wines from Piedmont
Marco Arlorio
Jean Daniel Coisson
Giorgio Leonardi
Monica Locatelli
Luigi Portinale
Predictive Approaches for Low-Cost Preventive Medicine Program in Developing Countries
Yukino Baba
Hisashi Kashima
Yasunobu Nohara
Eiko Kai
Partha Ghosh
Rafiqul Islam
Ashir Ahmed
Masahiro Kuroda
Sozo Inoue
Tatsuo Hiramatsu
Michio Kimura
Shuji Shimizu
Kunihisa Kobayashi
Koji Tsuda
Masashi Sugiyama
Mathieu Blondel
Naonori Ueda
Masaru Kitsuregawa
Naoki Nakashima
Dynamic Hierarchical Classification for Patient Risk-of-Readmission
Senjuti Basu Roy
Ankur Teredesai
Kiyana Zolfaghar
Rui Liu
David Hazel
Stacey Newman
Albert Marinez
ALOJA-ML: A Framework for Automating Characterization and Knowledge Discovery in Hadoop Deployments
Josep Lluís Berral
Nicolas Poggi
David Carrera
Aaron Call
Rob Reinauer
Daron Green
Multi-View Incident Ticket Clustering for Optimal Ticket Dispatching
Mirela Madalina Botezatu
Jasmina Bogojeska
Ioana Giurgiu
Hagen Voelzer
Dorothea Wiesmann
Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission
Rich Caruana
Yin Lou
Johannes Gehrke
Paul Koch
Marc Sturm
Noemie Elhadad
User Conditional Hashtag Prediction for Images
Emily Denton
Jason Weston
Manohar Paluri
Lubomir Bourdev
Rob Fergus
Big Data System for Analyzing Risky Procurement Entities
Amit Dhurandhar
Bruce Graves
Rajesh Ravi
Gopikrishanan Maniachari
Markus Ettl
Probabilistic Modeling of a Sales Funnel to Prioritize Leads
Brendan Andrew Duncan
Charles Peter Elkan
Online Topic-based Social Influence Analysis for the Wimbledon Championships
Varun R. Embar
Indrajit Bhattacharya
Vinayaka Pandit
Roman Vaculin
Collective Spammer Detection in Evolving Multi-Relational Social Networks
Shobeir Fakhraei
James Foulds
Madhusudana Shashanka
Lise Getoor
Utilizing Text Mining on Online Medical Forums to Predict Label Change due to Adverse Drug Reactions
Ronen Feldman
Oded Netzer
Aviv Peretz
Binyamin Rosenfeld
One-Pass Ranking Models for Low-Latency Product Recommendations
Antonino Freno
Martin Saveski
Rodolphe Jenatton
Cédric Archambeau
On the Reliability of Profile Matching Across Large Online Social Networks
Oana Goga
Patrick Loiseau
Robin Sommer
Renata Teixeira
Krishna P. Gummadi
E-commerce in Your Inbox: Product Recommendations at Scale
Mihajlo Grbovic
Vladan Radosavljevic
Nemanja Djuric
Narayan Bhamidipati
Jaikit Savla
Varun Bhagwan
Doug Sharp
Gender and Interest Targeting for Sponsored Post Advertising at Tumblr
Mihajlo Grbovic
Vladan Radosavljevic
Nemanja Djuric
Narayan Bhamidipati
Ananth Nagarajan
Mining Administrative Data to Spur Urban Revitalization
Ben Green
Alejandra Caro
Matthew Conway
Robert Manduca
Tom Plagge
Abby Miller
Measuring Causal Impact of Online Actions via Natural Experiments: Application to Display Advertising
Daniel N. Hill
Robert Moakler
Alan E. Hubbard
Vadim Tsemekhman
Foster Provost
Kiril Tsemekhman
Focusing on the Long-term: It's Good for Users and Business
Henning Hohnhold
Deirdre O'Brien
Diane Tang
Traffic Measurement and Route Recommendation System for Mass Rapid Transit (MRT)
Thomas Holleczek
Dang The Anh
Shanyang Yin
Yunye Jin
Spiros Antonatos
Han Leong Goh
Samantha Low
Amy Shi-Nash
Real-Time Bid Prediction using Thompson Sampling-Based Expert Selection
Elena Ikonomovska
Sina Jafarpour
Ali Dasdan
Life-stage Prediction for Product Recommendation in E-commerce
Peng Jiang
Yadong Zhu
Yi Zhang
Quan Yuan
Visual Search at Pinterest
Yushi Jing
David Liu
Dmitry Kislyuk
Andrew Zhai
Jiajing Xu
Jeff Donahue
Sarah Tavel
Discovering Collective Narratives of Theme Parks from Large Collections of Visitors' Photo Streams
Gunhee Kim
Leonid Sigal
A Machine Learning Framework to Identify Students at Risk of Adverse Academic Outcomes
Himabindu Lakkaraju
Everaldo Aguiar
Carl Shan
David Miller
Nasir Bhanpuri
Rayid Ghani
Kecia L. Addison
Probabilistic Graphical Models of Dyslexia
Yair Lakretz
Gal Chechik
Naama Friedmann
Michal Rosen-Zvi
Promoting Positive Post-Click Experience for In-Stream Yahoo Gemini Users
Mounia Lalmas
Janette Lehmann
Guy Shaked
Fabrizio Silvestri
Gabriele Tolomei
Generic and Scalable Framework for Automated Time-series Anomaly Detection
Nikolay Laptev
Saeed Amizadeh
Ian Flint
Leveraging Knowledge Bases for Contextual Entity Exploration
Joonseok Lee
Ariel Fuxman
Bo Zhao
Yuanhua Lv
Click-through Prediction for Advertising in Twitter Timeline
Cheng Li
Yue Lu
Qiaozhu Mei
Dong Wang
Sandeep Pandey
Predicting Voice Elicited Emotions
Ying Li
Jose D. Contreras
Luis J. Salazar
Discovery of Glaucoma Progressive Patterns Using Hierarchical MDL-Based Clustering
Shigeru Maya
Kai Morino
Hiroshi Murata
Ryo Asaoka
Kenji Yamanishi
Distributed Personalization
Xu Miao
Chun-Te Chu
Lijun Tang
Yitong Zhou
Joel Young
Anmol Bhasin
Voltage Correlations in Smart Meter Data
Rajendu Mitra
Ramachandra Kota
Sambaran Bandyopadhyay
Vijay Arya
Brian Sullivan
Richard Mueller
Heather Storey
Gerard Labut
Analyzing Invariants in Cyber-Physical Systems using Latent Factor Regression
Marjan Momtazpour
Jinghe Zhang
Saifur Rahman
Ratnesh Sharma
Naren Ramakrishnan
Predicting Future Scientific Discoveries Based on a Networked Analysis of the Past Literature
Meenakshi Nagarajan
Angela D. Wilkins
Benjamin J. Bachman
Ilya B. Novikov
Shenghua Bao
Peter J. Haas
María E. Terrón-Díaz
Sumit Bhatia
Anbu K. Adikesavan
Jacques J. Labrie
Sam Regenbogen
Christie M. Buchovecky
Curtis R. Pickering
Linda Kato
Andreas M. Lisewski
Ana Lelescu
Houyin Zhang
Stephen Boyer
Griff Weber
Ying Chen
Lawrence Donehower
Scott Spangler
Olivier Lichtarge
Learning a Hierarchical Monitoring System for Detecting and Diagnosing Service Issues
Vinod Nair
Ameya Raul
Shwetabh Khanduja
Vikas Bahirwani
Sundararajan Sellamanickam
Sathiya Keerthi
Steve Herbert
Sudheer Dhulipalla
Predictive Modeling for Public Health: Preventing Childhood Lead Poisoning
Eric Potash
Joe Brew
Alexander Loewi
Subhabrata Majumdar
Andrew Reece
Joe Walsh
Eric Rozier
Emile Jorgenson
Raed Mansour
Rayid Ghani
Proof Protocol for a Machine Learning Technique Making Longitudinal Predictions in Dynamic Contexts
Kevin B. Pratt
An Architecture for Agile Machine Learning in Real-Time Applications
Johann Schleier-Smith
Scalable Machine Learning Approaches for Neighborhood Classification Using Very High Resolution Remote Sensing Imagery
Manu Sethi
Yupeng Yan
Anand Rangarajan
Ranga Raju Vatsavai
Sanjay Ranka
Early Identification of Violent Criminal Gang Members
Elham Shaabani
Ashkan Aleali
Paulo Shakarian
John Bertetto
Spoken English Grading: Machine Learning with Crowd Intelligence
Vinay Shashidhar
Nishant Pandey
Varun Aggarwal
Effective Audience Extension in Online Advertising
Jianqiang Shen
Sahin Cem Geyik
Ali Dasdan
Going In-Depth: Finding Longform on the Web
Virginia Smith
Miriam Connor
Isabelle Stanton
Early Prediction of Cardiac Arrest (Code Blue) using Electronic Medical Records
Sriram Somanchi
Samrachana Adhikari
Allen Lin
Elena Eneva
Rayid Ghani
When-To-Post on Social Networks
Nemanja Spasojevic
Zhisheng Li
Adithya Rao
Prantik Bhattacharyya
Mining for Causal Relationships: A Data-Driven Study of the Islamic State
Andrew Stanton
Amanda Thart
Ashish Jain
Priyank Vyas
Arpan Chatterjee
Paulo Shakarian
Transfer Learning for Bilingual Content Classification
Qian Sun
Mohammad Amin
Baoshi Yan
Craig Martell
Vita Markman
Anmol Bhasin
Jieping Ye
FrauDetector: A Graph-Mining-based Framework for Fraudulent Phone Call Detection
Vincent S. Tseng
Jia-Ching Ying
Che-Wei Huang
Yimin Kao
Kuan-Ta Chen
Efficient Long-Term Degradation Profiling in Time Series for Complex Physical Systems
Liudmila Ulanova
Tan Yan
Haifeng Chen
Guofei Jiang
Eamonn Keogh
Kai Zhang
Interpreting Advertiser Intent in Sponsored Search
Bhanu C. Vattikonda
Santhosh Kodipaka
Hongyan Zhou
Vacha Dave
Saikat Guha
Alex C. Snoeren
Client Clustering for Hiring Modeling in Work Marketplaces
Vasilis Verroios
Panagiotis Papadimitriou
Ramesh Johari
Hector Garcia-Molina
Discerning Tactical Patterns for Professional Soccer Teams: An Enhanced Topic Model with Applications
Qing Wang
Hengshu Zhu
Wei Hu
Zhiyong Shen
Yuan Yao
Predicting Serves in Tennis using Style Priors
Xinyu Wei
Patrick Lucey
Stuart Morgan
Peter Carr
Machar Reid
Sridha Sridharan
Smart Pacing for Effective Online Ad Campaign Optimization
Jian Xu
Kuang-chih Lee
Wentong Li
Hang Qi
Quan Lu
From Infrastructure to Culture: A/B Testing Challenges in Large Scale Social Networks
Ya Xu
Nanyu Chen
Addrian Fernandez
Omar Sinno
Anmol Bhasin
Tornado Forecasting with Multiple Markov Boundaries
Kui Yu
Dawei Wang
Wei Ding
Jian Pei
David L. Small
Shafiqul Islam
Xindong Wu
Gas Concentration Reconstruction for Coal-Fired Boilers Using Gaussian Process
Chao Yuan
Matthias Behmann
Bernhard Meerbeck
Annotating Needles in the Haystack without Looking: Product Information Extraction from Emails
Weinan Zhang
Amr Ahmed
Jie Yang
Vanja Josifovski
Alex J. Smola
Forecasting Fine-Grained Air Quality Based on Big Data
Yu Zheng
Xiuwen Yi
Ming Li
Ruiyuan Li
Zhangqing Shan
Eric Chang
Tianrui Li
Building Discriminative User Profiles for Large-scale Content Recommendation
Erheng Zhong
Nathan Liu
Yue Shi
Suju Rajan
Stock Constrained Recommendation in Tmall
Wenliang Zhong
Rong Jin
Cheng Yang
Xiaowei Yan
Qi Zhang
Qiang Li
Predicting Ambulance Demand: a Spatio-Temporal Kernel Approach
Zhengyi Zhou
David S. Matteson
TUTORIAL SESSION: Tutorials
Web Personalization and Recommender Systems
Shlomo Berkovsky
Jill Freyne
Graph-Based User Behavior Modeling: From Prediction to Fraud Detection
Alex Beutel
Leman Akoglu
Christos Faloutsos
Data-Driven Product Innovation
Xin Fu
Hernán Asorey
Dense Subgraph Discovery: KDD 2015 tutorial
Aristides Gionis
Charalampos E. Tsourakakis
Diffusion in Social and Information Networks: Research Problems, Probabilistic Models and Machine Learning Methods
Manuel Gomez Rodriguez
Le Song
Social Media Anomaly Detection: Challenges and Solutions
Yan Liu
Sanjay Chawla
Automatic Entity Recognition and Typing from Massive Text Corpora: A Phrase and Network Mining Approach
Xiang Ren
Ahmed El-Kishky
Chi Wang
Jiawei Han
VC-Dimension and Rademacher Averages: From Statistical Learning Theory to Sampling Algorithms
Matteo Riondato
Eli Upfal
Large Scale Distributed Data Science using Apache Spark
James G. Shanahan
Laing Dai
Medical Mining: KDD 2015 Tutorial
Myra Spiliopoulou
Pedro Pereira Rodrigues
Ernestina Menasalvas
Big Data Analytics: Optimization and Randomization
Tianbao Yang
Qihang Lin
Rong Jin
PANEL SESSION: Panel
Data Driven Science: SIGKDD Panel
Katharina Morik
Hugh Durrant-Whyte
Gary Hill
Dietmar Müller
Tanya Berger-Wolf