The GroupLens lab was one of the first to study automated recommender systems with the construction of the "GroupLens" recommender, a Usenet article recommendation engine, and MovieLens, a popular movie recommendation site used to study recommendation engines, tagging systems, and user interfaces. The lab has also gained notability for its members' work studying open content communities such as Cyclopath, a geo-wiki that was used in the Twin Cities to help plan the regional cycling system.[1][2]
In 1992, John Riedl and Paul Resnick attended the
CSCWconference
together. After they heard keynote speaker Shumpei Kumon talk
about his vision for an information economy,[3] they began working on a collaborative filtering
system for Usenet news. The system collected ratings from Usenet readers and used those ratings to predict how much other readers would like an article before they read it. This recommendation engine was one of the first automated collaborative filtering systems in which algorithms were used to automatically form predictions based on historical patterns of ratings.[4] The overall system was called the "GroupLens" recommender, and the servers that collected the ratings and performed the computation were called the "Better Bit Bureau". This name was later dropped after a request from the Better Business Bureau. "GroupLens" is now used as a name both for this recommender system, and for the research lab at the University of Minnesota.
A feasibility test was done between MIT and
the University of Minnesota and a research paper was published including
the algorithm, the system design, and the results of the feasibility
study, in the CSCW conference of 1994.[5]
In 1993, Riedl and Resnick invited Joseph Konstan to join the
team. Together, they decided to create a higher-performance
implementation of the algorithms to support larger-scale deployments.
In summer 1995 the team gathered
Bradley Miller, David Maltz,
Jon Herlocker, and Mark Claypool for "Hack Week" to create
the new implementation, and to plan the next round of experiments.[6]
In the Spring of 1996, the first workshop on
collaborative filtering was put together by Resnick and
Hal Varian at the University of California, Berkeley.[7]
There, researchers from projects around the US
that were studying similar systems came together to share ideas and
experience.
In the summer of 1996, David Gardiner, a former Ph.D. student of Riedl's, introduced John Riedl to Steven Snyder. Snyder had been an early employee at Microsoft, but left Microsoft to come to Minnesota to do a Ph.D. in Psychology. He realized the commercial potential of collaborative filtering, and encouraged the team to found a company in April 1996. By June, Gardiner, Snyder, Miller, Riedl, and Konstan had incorporated their company, and by July they had their first round of funding, from Hummer Winblad Venture Partnersventure capital company.[8] Net Perceptions went on to be one of the leading companies in
personalization during the Internet boom of the late 1990s, and stayed
in business until 2004.[9][10] Based on their experience, Riedl and Konstan wrote a book about the lessons learned from deploying recommenders in practice.[11] Recommender systems have since become ubiquitous in the online world, with leading vendors such as Amazon and Netflix deploying highly sophisticated recommender systems.[12] Netflix even offered a $1 million prize for improvements in recommender technology.[13]
When
the EachMovie[14] site closed in 1997, the researchers behind it released
the anonymous rating data they had collected, for other researchers
to use. The GroupLens Research team, led by Brent Dahlen and Jon
Herlocker, used this data set to jumpstart a new movie recommendation
site called MovieLens which has been a very visible research platform, including a detailed discussion in a New Yorker article by
Malcolm Gladwell,[15] and a report in a full episode of ABC Nightline.[16]
Between 1997 and 2002 the group continued its research on
collaborative filtering, which became known in the community by the
more general term of recommender systems. With Joe Konstan's expertise in user interfaces,[17][18]
the team began exploring interface issues in recommenders, such as explanations,[19] and meta-recommendation systems.[20]
Brent Hecht joined the GroupLens faculty in 2013, focusing on geographic human-computer interaction. Lana Yarosh joined the GroupLens faculty in 2014; she works with social computing and child-computer interaction. A third professor, Haiyi Zhu, joined in 2015. Haiyi has published research on Facebook and other social networks. Stevie Chancellor, a human-centered computing and social computing researcher, joined the GroupLens faculty in 2020.[30]
The MovieLens recommender system:MovieLens is a non-commercial movie recommender system that has been running since 1997 with over 164,000 unique visitors as of 2009, who have provided over 15 million movie ratings.[31]
MovieLens ratings datasets: In the early days of recommender systems, research was slowed down by the lack of publicly available datasets. In response to requests from other researchers, GroupLens released three datasets:[32] the MovieLens 100,000 rating dataset, the MovieLens 1 million rating dataset, and the MovieLens 10 million rating dataset. These datasets became the standard datasets for recommender research, and have been used in over 300 papers by researchers around the world. The dataset is also being used for teaching about recommender technology.[33]
MovieLens tagging dataset: GroupLens added tagging to MovieLens in 2006. Since then, users have provided over 85,000 applications of 14,000 unique tags to movies.[34] The MovieLens 10 million ratings dataset also includes a 100,000 tag applications dataset for researchers to use.
Information leakage from recommender datasets: a paper in the information retrieval conference analyzed the privacy risks to users of having large recommender datasets released. The basic risk discovered is that an anonymized dataset might be combined with public information to identify a user. For instance, a user who has written about his preference for movies on online forums could be associated with a specific row in the MovieLens datasets. In some cases, these associations might leak information the user would prefer to keep private.[35][36]
Wikipedia research: The study of value and vandalism in Wikipedia published in 2007[37] described the concentration of contribution across Wikipedia editors. This paper was one of the first to focus on the length of time that a contribution survives within Wikipedia as a measure of its value. The paper also investigated the effects of vandalism on Wikipedia readers, by measuring the probability that a view of a page would capture that page in a vandalized state. GroupLens has also explored ways to help editors find pages which they can effectively contribute to with the SuggestBot recommender.[38] The group has also explored the evolution of the norms in Wikipedia that determine which articles are accepted or rejected, and the effect of changes in those norms on the Long Tail of Wikipedia articles.[39] GroupLens has also explored the functioning of the informal peer review system within Wikipedia to discover ways the decisions being made appear to be influenced inappropriately by ownership, and that experience does not seem to change editor performance very much.[40][41] GroupLens researchers have also explored visualizations of the edit history of Wikipedia articles.[42] In 2011, the GroupLens researchers completed a scientific exploration of gender imbalance in Wikipedia's popular editors, resulting in finding that there was a large gap between male and female editors.[43]
Shilling recommender systems: GroupLens has explored ways that users of recommender systems can attempt to inappropriately influence the recommendations given to other users.[44] They call this behavior shilling, because of its relationship to the practice of hiring associates to pretend to be enthusiastic customers. They showed that some types of shilling are likely to be effective in practice. One concern about shilling is that the false predictions may change the reported opinions of later users, further corrupting the recommendations.[45][46]
Cyclopath: Beginning in 2008, GroupLens launched Cyclopath, a computational geo-wiki for local bicyclists. Cyclopath has since been used by hundreds of cyclists within the Twin Cities.[47] More recently, Cyclopath has been adopted by the Twin Cities Metropolitan Council to help plan the regional cycling system.[1]
^
Kumon, Shumpei (1992). "From wealth to wisdom: a change in the social paradigm". Proceedings of the 1992 ACM conference on Computer-supported cooperative work. Computer Supported Cooperative Work. ACM Press. p. 3. doi:10.1145/143457.371587. ISBN0-89791-542-9.
^
Resnick, Paul; Iacovou, Neophytos; Suchak, Mitesh; Bergstrom, Peter; Riedl, John (1994). "GroupLens: an open architecture for collaborative filtering of netnews". Proceedings of the 1994 International ACM Conference on Computer Supported Cooperative Work. Computer Supported Cooperative Work. ACM Press. pp. 175–186. doi:10.1145/192844.192905. ISBN0-89791-689-1.
^
Borchers, Al; Herlocker, Jon; Konstan, Joseph; Riedl, John (April 1998). "Ganging up on Information Overload". Computer. 31 (4): 106–108. doi:10.1109/2.666847. ISSN0018-9162.
^
Riedl, John; Konstan, Joseph; Vrooman, Eric (August 2002). Word of Mouse: The Marketing Power of Collaborative Filtering. Grand Central. ISBN978-0-7595-2727-0.
^
Zeitchik, Steven (July 18, 2008). "Technology Gets Personal". The Wall Street Journal. p. W9. Retrieved 2009-12-23.