CS547 Human-Computer Interaction Seminar (Seminar on People, Computers, and Design)Fridays 12:30-2:20 · Gates B01 · Open to the public
Data Mining Meets HCI: Making Sense of Large Graphs
March 15, 2013
We have entered the era of big data. Datasets surpassing terabytes now arise in science, government and enterprises. Yet, making sense of these data remains a fundamental challenge. Where do we start our analysis? Where to go next? And how to visualize our findings? My research takes a step towards answering these questions.
I work in Data Mining and Human-Computer Interaction (HCI), and I combine the best from both worlds to create tools that help people make sense of graphs with billions of nodes and edges. I present my work in three interrelated topics.
(1) Attention Routing: I introduce this idea, based on anomaly detection and machine inference, that automatically draws people's attention to interesting parts of the graph. I describe two examples: the Polonium technology unearths malware from 37 billion machine-file relationships; the NetProbe system fingers bad guys who commit auction fraud.
(2) Mixed-Initiative Graph Sensemaking: I describe the Apolo system that combines machine inference and visualization to guide the user to interactively explore large graphs. The user gives examples of relevant nodes, and Apolo recommends which areas the user may want to see next. In a user study, Apolo helped participants find significantly more relevant articles than Google Scholar.
(3) Scaling Up: I show how we may enable interactive analytics of large graphs with a hybrid architecture that harnesses parallel computation for expensive tasks, and local computation for fast machine inference, visualization, and interaction.