Skip to main content
Cornell university
Cornell Statistics and Data Science Cornell Statistics and Data Science
  • About Us

    About Us
    Cornell's Department of Statistics and Data Science offers four programs at the undergraduate and graduate levels. Undergraduates can earn a BA in statistical science, social...

    Welcome to the Department of Statistics and Data Science
    History
    Facilities
    Statistics Graduate Society
    Recently Published Papers
  • Academics

    Academics

    Undergraduate
    PhD
    MPS
    PhD Minor in Data Science
    Courses & Course Enrollment
  • People

    People

    Faculty
    Field Faculty
    PhDs
    Emeritus Faculty
    Academic Staff
    Staff
    Research Areas of Expertise
    Statistical Consultants
  • News and Events

    News and Events

    Events
    News
  • Resources

    Resources

    Professional Societies and Meetings
    Affiliated Groups
    Career Services
    Cornell Statistical Consulting Unit
  • Alumni

    Alumni
    Cornell's Statistics and Data Science degrees prepare students for a wide variety of careers, from academia to industry.  See the After Graduation page for a general overview of...

    Alumni Profiles

Search form

You are here

  1. Home 
  2. Events 
  3. Statistics Seminars

Statistics Seminar Speaker: Karl Rohe, 11/14/2018

Event Layout

Wednesday Nov 14 2018

Statistics Seminar Speaker: Karl Rohe, 11/14/2018

4:15pm @ G01 Biotechnology
In Statistics Seminars

The Statistics Seminar speaker for Wednesday, November 14, 2018, is Karl Rohe, associate professor of statistics at the University of Wisconsin-Madison. He studies the statistical performance of computationally tractable estimation techniques for network data analysis and works on https://murmuration.wisc.edu.  He received his PhD and the Evelyn Fix Memorial Medal from UC Berkeley in 2011.

Talk: Making Spectral Graph Theory work in practice. Making the practice work in theory.

Abstract:After introducing Cheeger's Inequality and spectral clustering, this talk has two parts. The first part will (1) show how spectral clustering gives "bad results" in many applied settings and (2) illustrate a "hack" that makes it work very well. People call the hack "regularized spectral clustering,” but it is not clear why it has regularization properties.  The talk will provide a simple theory that provides a deeper understanding of where the bad results come from and why the hack works so well. This theory explains the regularization by showing that spectral clustering overfits to graph conductance.

There are four pieces to this simple theory. First, sparse and stochastic graphs create a lot of small trees that are connected to the core of the graph by only one edge. Second, graph conductance is sensitive to these noisy "dangling sets." Third, by Cheeger's inequality and an inequality from Ky Fan, spectral clustering inherits this sensitivity. These three pieces explain why spectral clustering gives bad results in practice. The fourth piece uses Cheeger's inequality to show how the hack creates a new form of graph conductance that we call CoreCut. Simple inspection of CoreCut reveals why it is less sensitive to small cuts in the graph. In addition to this statistical benefit, these results also demonstrate why the regularization also improves the computational speed of spectral clustering.  This is joint work with Yilin Zhang. https://arxiv.org/abs/1806.01468.  Some ideas will be illustrated with data from murmuration.wisc.edu/ 

 

Event Categories

  • Statistics Seminars
  • Special Events

Image Gallery

Karl Rohe
  • Home
  • About Us
  • Contact Us
  • Careers
© Cornell University Department of Statistics and Data Science

1198 Comstock Hall, 129 Garden Ave., Ithaca, NY 14853

Social Menu

  • Facebook
  • Twitter
  • YouTube
Cornell Bowers CIS College of Computing and Information Science Cornell CALS ILR School

If you have a disability and are having trouble accessing information on this website or need materials in an alternate format, contact web-accessibility@cornell.edu for assistance.