What Researchers Want - UMD Department of Computer Science

Download Report

Transcript What Researchers Want - UMD Department of Computer Science

What Researchers Want
Cody Dunne
Links from this talk:
Dept. of Computer Science and
Human-Computer Interaction Lab,
University of Maryland
[email protected]
bit.ly/stmwant
STM 3rd Master Class
November 7-9, 2011 Adelphi, MD, USA
1
Researchers want to…
1.
2.
3.
4.
Find a specific paper
Explore a research area
Do retrospective analysis
Share their results
2
1. Find a specific paper
• Metadata or PDF?
• From memory (search)
• From reference list
– DOI/URL
– Search
3
2. Exploring a research area
•
•
•
•
Foundations
Emerging research topics
State of the art/open problems
Collaborations & relationships between
Communities
• Field evolution
• Easily understandable surveys
4
User requirements
• Control over the paper collection
– Choose custom subset via query, then iteratively drill
down, filter, & refine
• Overview either as visualization or text statistics
– Orient within subset
• Easy to understand metrics for identifying
interesting papers
– Ranking & filtering
• Create groups & annotate with findings
– Organize discovery process
– Share results
5
Action Science Explorer
• Bibliometric lexical link mining to create a
citation network and citation context
• Network clustering and multi-document
summarization to extract key points
• Potent network analysis and visualization tools
www.cs.umd.edu/hcil/ase
6
7
Reference management & grouping
8
Citation network overview
Communities, outliers, invalid data
9
Statistics & visualization
• Network statistics
–
–
–
–
Degree
Betweenness
Closeness
Pagerank
• Attributes
–
–
–
–
Year
Downloads
Citations
References
10
Field evolution
11
Citation context &
summarization
• Citation context
– Key contributions
– Critical reception
– Citations to subsequent/similar work
• Hyperlinked citations in text
– See surrounding context of citation
– View cited papers while reading
• Multi-document summarization
– Citation context
– Abstract
– Full text
12
3. Retrospective analysis
• Automatic collection & processing of
bibliometric data
• Easy access to visual analytic tools for finding
clusters, trends, outliers
• Communities for sharing data, tools, & results
13
STICK Project
• Scientific, data-driven way to track innovations
– Vs. current expert-based, time consuming
approaches (e.g., Gartner’s Hype Cycle, tire track
diagrams)
• Includes both concept and product forms
– Study relationships between
• Study the innovation ecosystem
– Organizations & people
– Both those producing & using innovations
stick.ischool.umd.edu
14
Case study: tree visualization
• Problem: Traditional 2D node-link diagrams of
trees become too large
• Solutions:
– Treemaps: Nested Rectangles
– Cone Trees: 3D Interactive Animations
– Hyperbolic Trees: Focus + Context
• Measures:
– Papers, articles, patents, citations,…
– Press releases, blog posts, tweets,…
– Users, downloads, sales,…
15
Treemaps: nested rectangles
www.cs.umd.edu/hcil/treemap-history
16
Smartmoney MarketMap Feb 27, 2007
smartmoney.com/marketmap
17
Cone trees: 3D interactive animations
Robertson, G. G., Card, S. K., and Mackinlay, J. D., Information visualization using 3D interactive animation,
Communications of the ACM, 36, 4 (1993), 51-71.
Robertson, G. G., Mackinlay, J. D., and Card, S. K., Cone trees: Animated 3D visualizations of hierarchical information,
18
Proc. ACM SIGCHI Conference on Human Factors in Computing Systems, ACM Press, New York, (April 1991), 189-194.
Hyperbolic trees: focus & context
Lamping, J. and Rao, R., Laying out and visualizing large trees using a hyper-bolic space, Proc. 7th Annual ACM
symposium on User Interface Software and Technology, ACM Press, New York (1994), 13-14.
Lamping, J., Rao, R., and Pirolli, P., A focus+context technique based on hy-perbolic geometry for visualizing large
19
hierarchies, Proc. SIGCHI Conference on Human Factors in Computing Systems, ACM Press, New York (1995), 401-408.
TM=Treemaps
CT=Cone Trees
HT=Hyperbolic Trees
Patents
Academic
Papers
Trade Press
Articles
Case study: tree visualization impact
20
TM=Treemaps
CT=Cone Trees
HT=Hyperbolic Trees
Patents
Academic
Papers
Case study: tree visualization citations
21
Case study: business intelligence
Proquest News 2000-2009
Co-occurrence of concepts
with organizations
Data Mining
•
•
•
•
•
•
•
•
Year
National Security Agency
White House
FBI
AT&T
American Civil Liberties Union
Electronic Frontier Foundation
Dept. of Homeland Security
CIA
22
Business
Intelligence
2000-2009
Matrix
showing CoOccurrence
of concepts
and entities
23
Business
Intelligence
2000-2009:
(subset)
24
Business
Intelligence
2000-2009:
Data mining
•
•
•
•
•
•
•
•
•
•
•
NSA
CIA
FBI
White House
Pentagon
DOD
DHS
AT&T
ACLU
EFF
Senate Judiciar
Committee
25
Business
Intelligence
2000-2009:
Tech1
•
•
•
•
Google
Yahoo
Stanford
Apple
Tech2
• IBM, Cognos
• Microsoft
• Oracle
Finance
•
•
•
•
•
NASDAQ
NYSE
SEC
NCR
MicroStrategy
26
Business
Intelligence
2000-2009:
•
•
•
•
•
Air Force
Army
Navy
GSA
UMD*
27
STICK Process
• Identify concepts
• Query data sources
• Processing
• Automatic entity recognition
• Crowd-sourced verification
• Co-occurrence networks
• News
• Dissertation
• Academic
• Patent
• Visualizing & analyzing
• Overall statistics
• Co-occurrence networks
• Network evolution
• Blogs
• Sharing results
28
4. Sharing results
• Easily usable metadata (BibTeX, EndNote, etc.)
• Collaborative authoring
• Online communities
29
Collaborative literature reviews
• Organized references
• Annotated PDFs
www.mendeley.com
30
Shared data & analysis repositories
stick.ischool.umd.edu/community
31
Researchers want to…
1.
2.
3.
4.
Find a specific paper
Explore a research area
Do retrospective analysis
Share their results
32
What Researchers Want
Cody Dunne
Links from this talk:
Dept. of Computer Science and
Human-Computer Interaction Lab,
University of Maryland
[email protected]
bit.ly/stmwant
This work has been partially supported by
NSF grants IIS 0705832 (ASE) and
SBE 0915645 (STICK)
33