Cyberinfrastructure Opportunities at NSF
Download
Report
Transcript Cyberinfrastructure Opportunities at NSF
Water
EarthScope
Oceans Observations
Cyberinfrastructure in an Era of
Observation and Simulation
Eva Zanzerkia, NSF GEO/EAR
Environmental Obs
Satellites
Earth System Modeling
This Talk
Context at the National and Agency scale
CIF21
Data Management
What this means for CUAHSI
Framing the Challenge:
Science and Society Transformed by Data
Modern science
Multi-disciplinary
Collaborations for
Complexity
Data- and compute-intensive
Integrative, multiscale
Individuals, groups, teams,
communities
Sea of Data
Age of Observation
Distributed, central
repositories, sensor- driven,
diverse, etc
NSF ACCI: Innovation and New Paradigms
Multiple communities must engage rapidly,
integrate observations, expertise
o New multidisciplinary research communities and teams
o Teams notified of events by social networks, mobile
Simulations by global teams
New types of compute, data, collaboration and software
infrastructure
Comprehensive Computational and Data-
Enabled Science & Engineering (CDS&E)
Fundamental to support 21st century science and
engineering
New Budget Thrust for 2012: CIF 21
Broad Principles of CIF21
Comprehensive and integrated cyberinfrastructure to
transform research, innovation and education
Focus on computational and data-intensive science to
address complex problems
Increase of $117 million over FY 2010
enacted level
Four major components
Caveat: This is only the president’s budget
request to congress
CIF21: Four Thrust Areas
Organizations
Universities, schools
Government labs, agencies
Research and Medical
Centers
Libraries, Museums
Virtual Organizations
Communities
Community
Expertise
Research
Networks
Research and Scholarship
Education
Learning and Workforce
Development
Interoperability and operations
Cyberscience
Computational
Resources
Discovery
Collaboration
Education
Supercomputers
Clouds, Grids, Clusters
Visualization
Compute services
Data Centers
New Computational
Resources
Software
Applications, middleware
Software development and
support
Cybersecurity: access,
authorization, authentication
Scientific
Instruments
Large Facilities, MREFCs,telescopes,
Colliders, shake Tables, laboratory,
field deployment, Sensor Arrays
Data-Enabled
Science
Data
Databases, Data repositories
Collections and Libraries
Data Access; storage,
navigation
management, mining tools,
curation, privacy
Access and
Connections
to
Networking
Resources
Campus, national, international
networks
Research and experimental networks
End-to-end throughput
Cybersecurity
Data-Enabled Science
Data Services Program (data)
Provide reliable digital preservation, access, integration, and
analysis capabilities for science and/or engineering data over a
decades-long timeline
Data Analysis and Tools Program (information)
Data mining, manipulation, modeling, visualization, decisionmaking systems
Data-intensive Science Program (knowledge)
Intensive disciplinary efforts, multi-disciplinary discovery and
Observations
innovation
GIS
Climate
Models
Remote Sensing
New Computational Infrastructure
Computational and data-enabled resources
HPC, Clouds, Clusters, Data Centers
Long-term software for science and engineering
Sustained software development and support
Discipline-specific activities
Services, tools, compute environments that serve specific
research efforts and communities
Wyoming SCS
Access and Connectivity
Network connections and engineering program
Real-time access to facilities and instruments; Begins to tie in MREFC
activities
Integration and end-to-end performance to provide seamless access from
researcher to resource
Cybersecurity – from innovation to practice
Deployment of identity management systems
Development of cybersecurity prototypes
Community Research Networks
New multi-disciplinary research communities
Address challenges beyond individuals and disciplinary research
communities
Support and optimize collaboration across small, mid-level and large
community networks
Advanced research on community and social
networks
Structures, leadership, fostering and sustainability
“virtuous cycle” providing feedback through formal evaluation and program
iteration
CIF21 – Geosciences Foci
GEO plans to leverage substantial CI investments
Connect Well-established facilities: ES, OOI, IRIS, UNAVCO, IEDA, NCAR, CIG, CUAHSI,etc....
EAR Geoinformatics program
Participation in NSF-wide activities: PetaApps, SI2
Partnerships with other agencies and industry
Long-term plan: connect resources, science, users through National
cyberinfrastructure
New and enhanced computational platforms, tools, data centers to analyze, manipulate, visualize
and share large and complex data sets
a framework for open and easy access of all geoscience data and integrationg with other domains
Infrastructure and technology for retrieving and sharing observational data
Sustained training programs to create a workforce capable of multi-disciplinary science
EarthCube
GEO and OCI are working in partnership to support
the development of a geosciences wide
cyberinfrastructure
DCL: nsf11065
http://www.nsf.gov/pubs/2011/nsf11065/nsf11065.j
sp?WT.mc_id=USNSF_179
Webinar July 11, 2011.
Early Fall: charrette meeting to bring the geosciences
community together and focus a design
Prototypes (up to 3) will be supported in early Spring
Multiple Modes of Support Are Necessary for
EarthCube and CIF21
“Modes of support” that are essential to build CIF21
infrastructure and to engage in CIF21 activities.
Focused grants to individual PIs or small groups
Focused programs that are community driven
Small centers
Large national centers
Cyber-enhanced field programs
Cyber-enhanced observing facilities and MREFC projects
NSF-wide initiatives
Education, outreach, and training activities (EOT)
EarthCube
Modes of
Support
Access and Connection
New
Computational
Infrastructure
Networks
Data-Enabled Science
10-15 Year Timeline
CIF21 and CUAHSI
CIF21 and EarthCube is still being developed. There is the opportunity
to impact implementation through participation.
CIF21 dovetail with CUAHSI’s Strategic Plan:
Observation and Synthesis
improving accessibility to observational networks
data discovery and community modeling
Data Access
maintain services for diverse data
develop citation and tracking mechanisms
emerging data types and needs
outreach for standards
What are the capabilities that water sciences researchers will need in
the future?
Where can CI Enhance Partnerships?
Connect observing systems
Connect to other domains and resources:
DataNet; computational facilities; synthesis centers
Data Management Plans
Long-standing policy states NSF’s expectations with respect
to sharing of data and other research products:
Grant Conditions: “NSF expects investigators to share with
other researchers, at no more than incremental cost and within a
reasonable time, the data, samples, physical collections and other
supporting materials created or gathered in the course of the
work. It also encourages grantees to share software and
inventions or otherwise act to make the innovations they embody
widely useful and usable.”
Data Management Plan
All proposals must include, as a supplementary doc up to
2 pages, a Data Management Plan.
Plan should describe how the proposal will conform to
NSF policy on dissemination and sharing of research
results.
Plan will be reviewed as part of the intellectual merit
and/or broader impacts of the proposal depending on the
proposal intent.
FastLane will not permit submission of a proposal that is
missing a data management plan.
What’s Included?
o Anticipated data, samples, physical collections, software,
curriculum materials, anything produced during the reseach.
o Information on analytical standards, metadata, solutions or
remedies where these do not exist.
o Plans for data/sample access and sharing, including provision for
protection of privacy/confidentiality/ security, intellectual property,
or other rights, if appropriate.
o Plans for re-use, re-distribution, and production of derived data
sets.
o Plans for archiving/preserving data, samples, and other research
products and providing access to others.
Data Management Plan Resources
http://www.nsf.gov/geo/geo-data-policies/index.jsp
Data Policies and Community Standards
Templates
Review of DMP is an evolving process. More
guidance on review will be forthcoming.
Opportunities for CUAHSI
CIF21: Integration and Partnerships
Take advantage of resources
Wyoming Supercomputing Center
Powell Center
TeraGrid (XD)
DataNets: DataOne, Data Conservancy
Upcoming Competitions
SI2: NSF 11-539 July 18, 2011
EAR Geoinformatics (?): Will be tied to CIF21 and reports from recent
Geoinformatics WS
Data management plans are an opportunity
Set community standards for peer review
Provide resources for a broad community
Articulate your science needs and tools
Be flexible: technology advance outpaces integration into science;
Unanticipated Scientific Discovery is still allowed!