MSR 2010: 7th IEEE Working Conference on Mining Software Repositories


May 2nd-3rd, 2010
Cape Town, South Africa

Co-located with ICSE 2010,
International Conference on Software Engineering


Contact the MSR organizers at

General Chair

Audris Mockus
Avaya, USA

Program Co-chairs

Jim Whitehead
University of California, Santa Cruz

Thomas Zimmermann
Microsoft Research, USA

Challenge Chair

Abram Hindle
University of Waterloo, Canada

Program Committee

Giuliano Antoniol
(École Polytechnique de Montréal, Canada)
Andrew Begel
(Microsoft Corp., USA)
Christian Bird
(UC Davis, USA)
Li-Te Cheng
(IBM Research, USA)
Stephan Diehl
(U. of Trier, Germany)
Massimiliano Di Penta
(U. of Sannio, Italy)
Harald Gall
(U. of Zurich, Switzerland)
Tudor Girba
(U. of Bern, Switzerland)
Mike Godfrey
(U. of Waterloo, Canada)
Jesus Gonzalez-Barahona
(U. Rey Juan Carlos, Spain)
Ahmed Hassan
(Queen's U., Canada)
Reid Holmes
(U. of Washington, USA)
Katsuro Inoue
(U. of Osaka, Japan)
Huzefa Kagdi
(Missouri U. of Science and Technology, USA)
Miryung Kim
(U. of Texas (Austin), USA)
Sung Kim
(Hong Kong U. of Science and Technology, China)
Michele Lanza
(U. of Lugano, Switzerland)
Andrian Marcus
(Wayne State U., USA)
Ken-ichi Matsumoto
(Nara Institute of Science and Technology, Japan)
Tim Menzies
(West Virginia U., USA)
Nachiappan Nagappan
(Microsoft Corp., USA)
Martin Pinzger
(Delft Technical U., Netherlands)
Rahul Premraj
(Vrije U. Amsterdam, The Netherlands)
Lori Pollock
(U. of Delaware, USA)
Bill Pugh
(U. of Maryland, USA)
Martin Robillard
(McGill U., Canada)
Gregorio Robles
(U. Rey Juan Carlos, Spain)
Anita Sarma
(U. of Nebraska, USA)
Tao Xie
(North Caroline State U., USA)
Westley Weimer
(U. of Virginia, USA)
Laurie Williams
(North Carolina State U., USA)
Andy Zaidman
(Delft Technical U., Netherlands)
Andreas Zeller
(Saarland U., Germany)

Web Chair

Adrian Schröter
University of Victoria, Canada


ICSE 2010
Co-located with ICSE 2010,
Cape Town, South Africa

Steering Committee

Ahmed E. Hassan
Queen's University, Canada
Audris Mockus
Avaya, USA
Ric Holt
University of Waterloo, Canada
Katsuro Inoue
Osaka University, Japan
Stephan Diehl
University Trier, Germany
Harald Gall
University of Zurich, Switzerland
Michele Lanza
University of Lugano, Switzerland Michael W. Godfrey
University of Waterloo, Canada

Earlier MSRs

MSR 2009 – Vancouver
MSR 2008 – Leipzig
MSR 2007 – Minneapolis
MSR 2006 – Shanghai
MSR 2005 – Saint Louis
MSR 2004 – Edinburgh

Latest news

Slides from Jim Herbsleb's keynote Mining for Scientific Results? available as pdf here.

Slides from Michele Lanza's keynote The Visual Terminator are now available on

Program is now online and Proceedings are now online.

Register now for MSR:

The MSR 2010 Prediction Challenge is extended 2 days! Submit your predictions by February 22 and you could win a Zune HD!

Acceptance notifications have been sent: 16 full papers and 5 short papers (acceptance rate 31.3%).

Follow MSR 2010 on Twitter and Facebook.

MSR 2010 Highlights

The MSR 2010 program includes:


Software repositories such as source control systems, archived communications between project personnel, and defect tracking systems are used to help manage the progress of software projects. Software practitioners and researchers are recognizing the benefits of mining this information to support the maintenance of software systems, improve software design/reuse, and empirically validate novel ideas and techniques. Research is now proceeding to uncover the ways in which mining these repositories can help to understand software development and software evolution, to support predictions about software development, and to exploit this knowledge concretely in planning future development.

The goal of this two-day working conference is to advance the science and practice of software engineering via the analysis of data stored in software repositories.

We solicit short papers (4 pages) and research papers (10 pages). Short papers should discuss controversial issues in the field, or describe interesting or thought provoking ideas that are not yet fully developed. Accepted short papers will present their ideas in poster form during a poster session at the conference, and in a short lightning talk. Full research papers are expected to describe new research results, and have a higher degree of technical rigor than short papers. Accepted full papers will present their ideas in a research talk at the conference. Paper submissions must be formatted in the IEEE CS proceedings style (Two Column Format) A selection of the best research papers will be invited for consideration in a special issue of the Springer journal Empirical Software Engineering.


Papers may address issues along the general themes, including but not limited to the following:

  • Analysis of software ecosystems and mining of repositories across multiple projects
  • Models for social and development processes that occur in large software projects
  • Prediction of future software qualities via analysis of software repositories
  • Models of software project evolution based on historical repository data
  • Characterization, classification, and prediction of software defects based on analysis of software repositories
  • Techniques to model reliability and defect occurrences
  • Search-based software engineering, including search techniques to assist developers in finding suitable components and code fragments for reuse, and software search engines
  • Analysis of change patterns and trends to assist in future development
  • Visualization techniques and models of mined data
  • Techniques and tools for capturing new forms of data for storage in software repositories, such as effort data, fine-grained changes, and refactoring
  • Approaches, applications, and tools for software repository mining
  • Characterization of bias in mining and guidelines to ensure quality results
  • Meta-models, exchange formats, and infrastructure tools to facilitate the sharing of extracted data and to encourage reuse and repeatability
  • Case studies on extracting data from repositories of large long-lived and/or industrial projects
  • Methods of integrating mined data from various historical sources

MSR Challenge

MSR Challenge. We invite researchers to demonstrate the usefulness of their mining tools on the source code repositories, bug data, and mailing list archives of the FreeBSD distribution, Ultimate Debian Database, and the GNOME desktop suite by participating in the two MSR Challenge tracks:

  • General. Discover interesting facts about the history of FreeBSD? distribution, Ultimate Debian Database, and the GNOME desktop suite. Results should be reported as 4-page submissions, to be included in the proceedings as challenge papers.
  • Prediction. We challenge you to predict the newest bug number for the Debian project on April 30th, 2010. You can provide 1-page long descriptions of the rationale behind your prediction. Wild guesses are also welcome and will put "real" miners under pressure.
The winners of both tracks will receive an award.
Click here for a more detailed description of the challenge.

Important Dates

Submission (research/short papers): January 14, 2010 extended until end-of-day Sunday, January 17, 2010
Submission (challenge paper): February 6, 2010
Submission (challenge predictions): February 20, 2010
Author notification (all tracks): February 20, 2010
Camera-ready copy: March 12, 2010
Conference dates: May 2nd-3rd, 2010

Note: All deadlines are 11:59 PM (Apia, Samoa Time) on the dates indicated.

Note: Submissions may still be made to the conference even if an abstract has not previously been submitted.


Click here to submit MSR 2010 abstracts and papers.
Click here to submit MSR 2010 challenge reports.
Papers must be in the IEEE CS proceedings style - Two Column Format.

The proceedings for MSR-2010 will be published electronically by the IEEE digital library. Additionally, attendees of MSR-2010 will be able to download an electronic version during the conference.