This document summarizes a panel discussion on building the National Research Platform ecosystem with regional networks. The panelists discussed how their regional networks are connecting to and using the Nautilus nodes of the NRP. Examples included using NRP for deep learning and computer vision research at the University of Missouri, challenges of adoption in Nevada and potential solutions, and Georgia Tech's new involvement through the Southern Crossroads regional network. The regional networks see opportunities to expand NRP access and training to enable more researchers in their regions to take advantage of the platform.
Report
Share
Report
Share
1 of 19
More Related Content
Similar to Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
High-level Meeting & Workshop on Environmental and Scientific Open Data for Sustainable Development Goals in Developing Countries. Madagascar, 4-6 December 2017
Supermicro designed and implemented a rack-level cluster solution for San Diego Supercomputing Center (SDSC) optimized for their custom and experimental AI training and inferencing workloads, and meeting their environmental and TCO requirements. The project team will discuss the journey of designing and deploying our Rack Plug and Play cluster, and Shawn Strande, Dupty Director, SDSC, will be sharing his experience of partnering with the Supermicro team to solve his challgenges in HPC and AI.
The team will also share the technology that powers the SDSC Voyager Supercomputer, the Habana Gaudi AI system with 3rd Gen Intel® Xeon® Scalable processors for Deep Learning Training, and Habana Goya for Inferencing.
Watch the webinar: https://www.brighttalk.com/webcast/17278/517013
In this deck from the 2019 Stanford HPC Conference, Nik Nystrom from the Pittsburgh Supercomputing Center presents: Pioneering and Democratizing Scalable HPC+AI.
"PSC's Bridges was the first system to successfully converge HPC, AI, and Big Data. Designed for the U.S. national research community and supported by NSF, Bridges now serves approximately 1800 projects and 7500 users at 380 institutions, and it is the foundation around which new HPC+AI projects have launched. Bridges emphasizes "nontraditional" uses that span the life, physical, and social sciences, computer science, engineering, business, and humanities. Scalable HPC+AI is driving many of those applications, which span diverse topics such as learning root causes of cancer, strategic reasoning, designing new materials, predicting severe storms, recognizing speech including contextual information, and detecting objects in 4k streaming video. To address the demand for scalable AI, PSC recently introduced Bridges-AI, which adds transformative new AI capability. In this presentation, we share our vision in designing HPC+AI systems at PSC and highlight some of the exciting research breakthroughs they are enabling."
Nick Nystrom is Interim Director and Sr. Director of Research at the Pittsburgh Supercomputing Center (PSC). Nick is architect and PI for Bridges, PSC's flagship system that successfully pioneered the convergence HPC, AI, and Big Data. He is also PI for the NIH Human Biomolecular Atlas Program’s HIVE Infrastructure Component and co-PI for projects that bring emerging AI technologies to research (Open Compass), apply machine learning to biomedical data for breast and lung cancer (Big Data for Better Health), and identify causal relationships in biomedical big data (the Center for Causal Discovery, an NIH Big Data to Knowledge Center of Excellence). His current research interests include hardware and software architecture, applications of machine learning to multimodal data (particularly for the life sciences) and to enhance simulation, and graph analytics.
Watch the video: https://youtu.be/ucRs4A_afus
Learn more: https://www.psc.edu/bridges
and
http://hpcadvisorycouncil.com/events/2019/stanford-workshop/
Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter
This document summarizes a kick-off meeting for the UR3 project, which aims to implement a cloud computing infrastructure for sharing data, algorithms, and high performance computing resources among different teams and communities. It outlines the objectives, tasks, timeline, and involved partners of the UR3 project. It also discusses concepts for the cloud architecture, including virtualization, horizontal and vertical scalability, and the benefits of a cloud model for optimizing resource usage and reducing costs.
RNP is the national advanced network for higher education, research, and innovation institutions in Brazil. It connects over 600 organizations and 1,500 campuses across Brazil. RNP developed a software defined infrastructure (SDI) composed of an overlay SDN and nationwide distributed edge cloud with an orchestration platform to provision remote computing and networking resources using open source solutions. The SDI supports evolving RNP's existing applications and services to a hybrid cloud model and allows new methods of validation, testing, and provisioning. Current activities with the SDI include improving network and application architectures, automation, security, and advanced services like CDN and video delivery.
This document summarizes a research project on applying nanotechnology to cognitive radio and electronic warfare. It outlines the project members and faculty leads, problem statement, approach, and schedule. The problem aims to identify how nanotechnology can address spectrum sensing requirements for cognitive radio. The approach includes a literature review on integrating nanotechnology into wireless networks and studying the role of GPU technology in nanocomputing. Students conducted this research and a workshop on robotics and cybersecurity for high school students.
Academic research institutions are at a precipice. They have historically been constrained to supporting classic “job” style workloads. With the growth of new workflow practices such as streaming data, science gateways, and more “dynamic” research using lambda-like functions, they must now support a variety of workloads.
In this talk, Lindsey and Bob will discuss some difficulties faced by academic institutions and how Kubernetes offers an extensible solution to support the future of research. They will present a selection of projects currently benefiting from Kubernetes enabled tools, like Argo, Kubeflow, and kube-batch. These workflows will be demonstrated using specific examples from two large research institutions: Compute Canada, Canada’s national computation research consortium and the University of Michigan, one of the largest public Universities in the United States.
KubeCon EU 2019
AusCover Earth Observation Services and Data CubesTERN Australia
The presentation provides an overview of earth observation services offered by AusCover Facility of TERN. The presentation was part of the Workshop on Approaches to Terrestrial Ecosystem Data Management : from collection to synthesis and beyond which was held on 9th of March 2016 in University of Queensland.
Democratizing machine learning on kubernetesDocker, Inc.
One of the largest challenges facing the machine learning community today is understanding how to build a platform to run common open-source machine learning libraries such as Tensorflow. Both Joy and Lachie are both passionate about making machine learning accessible to the masses using Kubernetes. In this session they'll share how to deploy a distributed Tensorflow training cluster complete with GPU scheduling on Kubernetes. We'll also share how distributed Tensorflow training works, various options for distributed training, and when to choose what option. We'll also share some best practices on using distributed Tensorflow on top of Kubernetes, based on our latest performance tests performed on public cloud providers. All work presented in this session will be accessible via a public Github repository.
Synergy 2014 - Syn122 Moving Australian National Research into the CloudCitrix
The document summarizes the National Servers Project (NSP) which provides a cloud platform for hosting core services for Australian researchers. The NSP aims to provide a reliable, secure environment for researchers to host important applications so they can focus on managing their research. It highlights how the NSP was implemented including building the management stack and offering self-service features to allow researchers to control virtual machines. Several national research projects are using the NSP including IMOS, Quadrant and TERN.
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube
This webinar features project overviews of all EarthCube Awards (Building Blocks, Research Coordination Networks, Conceptual Designs, and Test Governance), followed by a call for involvement, and a Q&A session.
Agenda:
EarthCube Awards – Project Overviews
1.. EarthCube Web Services (Building Block)
2. EC3: Earth-Centered Community for Cyberinfrastructure (RCN)
3. GeoSoft (Building Block)
4. Specifying and Implementing ODSIP (Building Block)
5. A Broker Framework for Next Generation Geoscience (BCube) (Building Block)
6. Integrating Discrete and Continuous Data (Building Block)
7. EAGER: Collaborative Research (Building Block)
8. A Cognitive Computer Infrastructure for Geoscience (Building Block)
9. Earth System Bridge (Building Block)
10. CINERGI – Community Inventory of EC Resources for Geoscience Interoperability (BB)
11. Building a Sediment Experimentalist Network (RCN)
12. C4P: Collaboration and Cyberinfrastructure for Paleogeosciences (RCN)
13. Developing a Data-Oriented Human-centric Enterprise for Architecture (CD)
14. Enterprise Architecture for Transformative Research and Collaboration (CD)
15. EC Test Enterprise Governance: An Agile Approach (Test Governance)
A Call for Involvement!
Looking Back, Looking Forward NSF CI Funding 1985-2025Larry Smarr
This document provides an overview of the development of national research platforms (NRPs) from 1985 to the present, with a focus on the Pacific Research Platform (PRP). It describes the evolution of the PRP from early NSF-funded supercomputing centers to today's distributed cyberinfrastructure utilizing optical networking, containers, Kubernetes, and distributed storage. The PRP now connects over 15 universities across the US and internationally to enable data-intensive science and machine learning applications across multiple domains. Going forward, the document discusses plans to further integrate regional networks and partner with new NSF-funded initiatives to develop the next generation of NRPs through 2025.
This a RECAP project overview slide deck prepared by Thang Le Duc (UMU), P-O Östberg (UMU) and Tomas Brännström (Tieto). It starts with an introduction and continues with a section on challenges for a self-orchestrated, self-remediated cloud system. It then presents the RECAP vision and use cases and finishes with a conclusion.
- The Pacific Research Platform (PRP) interconnects campus DMZs across multiple institutions to provide high-speed connectivity for data-intensive research.
- The PRP utilizes specialized data transfer nodes called FIONAs that provide disk-to-disk transfer speeds of 10-100Gbps.
- Early applications of the PRP include distributing telescope data between UC campuses, connecting particle physics experiments to computing resources, and enabling real-time wildfire sensor data analysis.
This document summarizes a new cloud-based platform for analyzing large satellite Earth observation (EO) data. Key points:
- Traditional desktop tools cannot handle the growing size of EO data, requiring a shift to cloud-native analytics using big data tools.
- The demonstrated platform brings algorithms to distributed data using open-source tools like Kubernetes, Docker, and Jupyter notebooks running on Google Cloud.
- It provides scalable infrastructure for processing and analyzing petabytes of EO data through pipelines, supports open data standards, and ensures data sovereignty with Canadian storage.
The document discusses Internet2, an advanced networking consortium that operates a 15,000 mile fiber optic network for research and education. It provides very high speed connectivity and collaboration technologies to facilitate large data sharing and frictionless research. Examples are given of life sciences projects utilizing Internet2's high-speed network for genomic research and agricultural applications involving terabytes of satellite and sensor data. The network is expanding to include cloud computing resources and supercomputing centers to enable global-scale distributed scientific computing and collaboration.
Giles Sirett, CEO of ShapeBlue, will be giving an outline of what everyone can expect for the day. Join the official event opening to learn more for the upcoming sessions, speakers and networking events. Discover also the latest news from the CloudStack Community!
-----------------------------------------
The CloudStack European User Group 2022 took place on 7th April. The day saw a virtual get together for the European CloudStack Community, hosting 265 attendees from 25 countries. The event hosted 10 sessions with from leading CloudStack experts, users and skilful engineers from the open-source world, which included: technical talks, user stories, new features and integrations presentations and more.
------------------------------------------
About CloudStack: https://cloudstack.apache.org/
EPOC and NetSage provide engagement and network monitoring services to support research and education. NetSage collects anonymized network flow data to help understand traffic patterns and troubleshoot performance issues. It provides dashboards and analysis to answer common questions from network engineers and end users. Examples of NetSage deployments and use cases were shown for the CENIC network, including top sources and destinations of traffic, debugging slow flows, and analyzing international traffic patterns by country over time.
Research data zone: veilige en geoptimaliseerde netwerkomgeving voor onderzoe...SURFnet
This document discusses using dedicated servers called data transfer nodes (DTNs) to improve data transfer speeds between research institutions. DTNs are part of a network architecture called a Science DMZ that optimizes high-speed transfers. The document recommends:
- Deploying high-performance DTNs with fast storage in a separate network zone dedicated to research data and services.
- Configuring lossless connections and security policies that don't impede transfers between DTNs and research networks.
- Educating IT departments on maintaining and supporting the infrastructure to improve end-user performance for data-intensive research collaborations.
The Rise of Supernetwork Data Intensive ComputingLarry Smarr
Invited Remote Lecture to SC21
The International Conference for High Performance Computing, Networking, Storage, and Analysis
St. Louis, Missouri
November 18, 2021
My Remembrances of Mike Norman Over The Last 45 YearsLarry Smarr
Mike Norman has been a leader in computational astrophysics for over 45 years. Some of his influential work includes:
- Cosmic jet simulations in the early 1980s which helped explain phenomena from galactic centers.
- Pioneering the use of adaptive mesh refinement in the 1990s to achieve dynamic load balancing on supercomputers.
- Massive cosmology simulations in the late 2000s with over 100 trillion particles using thousands of processors across multiple supercomputing sites, producing petabytes of data.
- Developing end-to-end workflows in the 2000s to couple supercomputers, high-speed networks, and large visualization systems to enable real-time analysis of extremely large astrophysics simulations.
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019Larry Smarr
Larry Smarr discusses quantifying his body and health over time through extensive self-tracking. He measures various biomarkers through regular blood tests and analyzes his gut microbiome by sequencing stool samples. This revealed issues like chronic inflammation and an unhealthy microbiome. Smarr then took steps like a restricted eating window and increasing plant diversity in his diet, which reversed metabolic syndrome issues and correlated with shifts in his microbiome ecology. His goal is to continue precisely measuring factors like toxins, hormones, gut permeability and food/supplement impacts to further optimize his health.
Panel: Reaching More Minority Serving InstitutionsLarry Smarr
This document discusses engaging more minority serving institutions (MSIs) in cyberinfrastructure development through regional networks. It provides data showing the importance of MSIs like historically black colleges and universities (HBCUs) in educating underrepresented minority students in STEM fields. Regional networks can help equalize opportunities by assisting MSIs in overcoming barriers to resources through training, networking infrastructure support, and helping institutions obtain necessary staffing and funding. Strategies mentioned include collaborating with MSIs on grants and addressing issues identified in surveys like lack of vision for data use beyond compliance. The goal is to broaden participation in STEAM fields by leveraging the success MSIs have shown in supporting underrepresented students.
Global Network Advancement Group - Next Generation Network-Integrated SystemsLarry Smarr
This document summarizes a presentation on global petascale to exascale workflows for data intensive sciences. It discusses a partnership convened by the GNA-G Data Intensive Sciences Working Group with the mission of meeting challenges faced by data-intensive science programs. Cornerstone concepts that will be demonstrated include integrated network and site resource management, model-driven frameworks for resource orchestration, end-to-end monitoring with machine learning-optimized data transfers, and integrating Qualcomm's GradientGraph with network services to optimize applications and science workflows.
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...Larry Smarr
This document discusses opportunities for ESnet to support wireless edge computing through developing a strategy around self-guided field laboratories (SGFL). It outlines several potential science use cases that could benefit from wireless and distributed computing capabilities, both in the short term through technologies like 5G, LoRa and Starlink, and longer term through the vision of automated SGFL. The document proposes some initial ideas for deploying and testing wireless edge computing technologies through existing projects to help enable the SGFL vision and further scientific opportunities. It emphasizes that exploring these emerging areas could help drive new science possibilities if done at a reasonable scale.
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon MoonLarry Smarr
This document provides an overview of Asia Pacific and Korea research platforms. It discusses the Asia Pacific Research Platform working group in APAN, including its objectives to promote HPC ecosystems and engage members. It describes the Asi@Connect project which provides high-capacity internet connectivity for research across Asia-Pacific. It also discusses the Korea Research Platform and efforts to expand it to 25 national research institutes in Korea. New related projects on smart hospitals, agriculture, and environment are mentioned. The conclusion discusses enhancing APAN and the Korea Research Platform and expanding into new areas like disaster and AI education.
BT & Neo4j: Knowledge Graphs for Critical Enterprise Systems.pptx.pdfNeo4j
Presented at Gartner Data & Analytics, London Maty 2024. BT Group has used the Neo4j Graph Database to enable impressive digital transformation programs over the last 6 years. By re-imagining their operational support systems to adopt self-serve and data lead principles they have substantially reduced the number of applications and complexity of their operations. The result has been a substantial reduction in risk and costs while improving time to value, innovation, and process automation. Join this session to hear their story, the lessons they learned along the way and how their future innovation plans include the exploration of uses of EKG + Generative AI.
The DealBook is our annual overview of the Ukrainian tech investment industry. This edition comprehensively covers the full year 2023 and the first deals of 2024.
Fluttercon 2024: Showing that you care about security - OpenSSF Scorecards fo...Chris Swan
Have you noticed the OpenSSF Scorecard badges on the official Dart and Flutter repos? It's Google's way of showing that they care about security. Practices such as pinning dependencies, branch protection, required reviews, continuous integration tests etc. are measured to provide a score and accompanying badge.
You can do the same for your projects, and this presentation will show you how, with an emphasis on the unique challenges that come up when working with Dart and Flutter.
The session will provide a walkthrough of the steps involved in securing a first repository, and then what it takes to repeat that process across an organization with multiple repos. It will also look at the ongoing maintenance involved once scorecards have been implemented, and how aspects of that maintenance can be better automated to minimize toil.
YOUR RELIABLE WEB DESIGN & DEVELOPMENT TEAM — FOR LASTING SUCCESS
WPRiders is a web development company specialized in WordPress and WooCommerce websites and plugins for customers around the world. The company is headquartered in Bucharest, Romania, but our team members are located all over the world. Our customers are primarily from the US and Western Europe, but we have clients from Australia, Canada and other areas as well.
Some facts about WPRiders and why we are one of the best firms around:
More than 700 five-star reviews! You can check them here.
1500 WordPress projects delivered.
We respond 80% faster than other firms! Data provided by Freshdesk.
We’ve been in business since 2015.
We are located in 7 countries and have 22 team members.
With so many projects delivered, our team knows what works and what doesn’t when it comes to WordPress and WooCommerce.
Our team members are:
- highly experienced developers (employees & contractors with 5 -10+ years of experience),
- great designers with an eye for UX/UI with 10+ years of experience
- project managers with development background who speak both tech and non-tech
- QA specialists
- Conversion Rate Optimisation - CRO experts
They are all working together to provide you with the best possible service. We are passionate about WordPress, and we love creating custom solutions that help our clients achieve their goals.
At WPRiders, we are committed to building long-term relationships with our clients. We believe in accountability, in doing the right thing, as well as in transparency and open communication. You can read more about WPRiders on the About us page.
RPA In Healthcare Benefits, Use Case, Trend And Challenges 2024.pptxSynapseIndia
Your comprehensive guide to RPA in healthcare for 2024. Explore the benefits, use cases, and emerging trends of robotic process automation. Understand the challenges and prepare for the future of healthcare automation
Blockchain technology is transforming industries and reshaping the way we conduct business, manage data, and secure transactions. Whether you're new to blockchain or looking to deepen your knowledge, our guidebook, "Blockchain for Dummies", is your ultimate resource.
Comparison Table of DiskWarrior Alternatives.pdfAndrey Yasko
To help you choose the best DiskWarrior alternative, we've compiled a comparison table summarizing the features, pros, cons, and pricing of six alternatives.
TrustArc Webinar - 2024 Data Privacy Trends: A Mid-Year Check-InTrustArc
Six months into 2024, and it is clear the privacy ecosystem takes no days off!! Regulators continue to implement and enforce new regulations, businesses strive to meet requirements, and technology advances like AI have privacy professionals scratching their heads about managing risk.
What can we learn about the first six months of data privacy trends and events in 2024? How should this inform your privacy program management for the rest of the year?
Join TrustArc, Goodwin, and Snyk privacy experts as they discuss the changes we’ve seen in the first half of 2024 and gain insight into the concrete, actionable steps you can take to up-level your privacy program in the second half of the year.
This webinar will review:
- Key changes to privacy regulations in 2024
- Key themes in privacy and data governance in 2024
- How to maximize your privacy program in the second half of 2024
Quantum Communications Q&A with Gemini LLM. These are based on Shannon's Noisy channel Theorem and offers how the classical theory applies to the quantum world.
Mitigating the Impact of State Management in Cloud Stream Processing SystemsScyllaDB
Stream processing is a crucial component of modern data infrastructure, but constructing an efficient and scalable stream processing system can be challenging. Decoupling compute and storage architecture has emerged as an effective solution to these challenges, but it can introduce high latency issues, especially when dealing with complex continuous queries that necessitate managing extra-large internal states.
In this talk, we focus on addressing the high latency issues associated with S3 storage in stream processing systems that employ a decoupled compute and storage architecture. We delve into the root causes of latency in this context and explore various techniques to minimize the impact of S3 latency on stream processing performance. Our proposed approach is to implement a tiered storage mechanism that leverages a blend of high-performance and low-cost storage tiers to reduce data movement between the compute and storage layers while maintaining efficient processing.
Throughout the talk, we will present experimental results that demonstrate the effectiveness of our approach in mitigating the impact of S3 latency on stream processing. By the end of the talk, attendees will have gained insights into how to optimize their stream processing systems for reduced latency and improved cost-efficiency.
Transcript: Details of description part II: Describing images in practice - T...BookNet Canada
This presentation explores the practical application of image description techniques. Familiar guidelines will be demonstrated in practice, and descriptions will be developed “live”! If you have learned a lot about the theory of image description techniques but want to feel more confident putting them into practice, this is the presentation for you. There will be useful, actionable information for everyone, whether you are working with authors, colleagues, alone, or leveraging AI as a collaborator.
Link to presentation recording and slides: https://bnctechforum.ca/sessions/details-of-description-part-ii-describing-images-in-practice/
Presented by BookNet Canada on June 25, 2024, with support from the Department of Canadian Heritage.
Support en anglais diffusé lors de l'événement 100% IA organisé dans les locaux parisiens d'Iguane Solutions, le mardi 2 juillet 2024 :
- Présentation de notre plateforme IA plug and play : ses fonctionnalités avancées, telles que son interface utilisateur intuitive, son copilot puissant et des outils de monitoring performants.
- REX client : Cyril Janssens, CTO d’ easybourse, partage son expérience d’utilisation de notre plateforme IA plug & play.
論文紹介:A Systematic Survey of Prompt Engineering on Vision-Language Foundation ...Toru Tamaki
Jindong Gu, Zhen Han, Shuo Chen, Ahmad Beirami, Bailan He, Gengyuan Zhang, Ruotong Liao, Yao Qin, Volker Tresp, Philip Torr "A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models" arXiv2023
https://arxiv.org/abs/2307.12980
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
1. Slide 1
February 9, 2023
Slide 1
February 9, 2023
Building the NRP Ecosystem with the Regional
Networks on their Campuses
4th National Research Platform Workshop
February 9, 2023
4. Slide 4
February 9, 2023
www.thequilt.net
Panelists
• Grant Scott, Director, Data Science and Analytics Masters Program,
University of Missouri
• Scotty Strachan, Principal Research Engineer, NevadaNet
• Eric Buckhalt, Senior Director, Technology Strategy, Architecture, &
Governance, Georgia Tech
5. Using NRP @ University of Missouri (Mizzou)
Grant Scott
Assistant Professor
• Computer Science, Computer
Engineering
• College of Engineering
Director,
• Data Science and Analytics, MS
Program
• Institute for Data Science and
Informatics
Provided Data Science and Computer
Science Learning Outreach using MU & NRP
Jupyter
• US Government Intelligence Agency
• USDA ARS Long-Term Agroecosystem
Research (LTAR) Data Managers
• Tutorials at Regional and International
conferences
https://scottgs.mufaculty.umsystem.edu/
Evangelizing the use of Nautilus / NRP for research
computing and STEM education
6. Using Nautilus For Research @ MU
Grant Scott with mentees Alex Hurt, Anes Ouadou
Case Study in Deep Learning for
Computer Vision
• Journal Publication Results in Weeks or
days
• Using State-of-the-Art Deep Learning
Computer Vision Models for Satellite
Imagery Data Sets
Assisting numerous other researchers
using ML & Deep Learning
• Protein Structure Prediction
• Robotics
• UAS Navigation & Vision
Tutorials & Workshops
• Great Plains Network Annual Meeting 2022
• Getting Started on Nautilus and Kubernetes
• SC 22 – GPN Booth
• Scaling Deep Learning with Nautilus
• Great Plains Network Annual Meeting 2023
• Getting Started on Nautilus and Kubernetes
• Scaling Deep Learning with Nautilus
7. Scaling Deep Learning with Kubernetes on Nautilus
• Using containerized model definition and list of
jobs
• Mounted persistent data storage to each pod
• Each GPU job produces an associated trained
model
• Automation currently performed via environment
variables and bash, but more sophisticated
methods in development
• Models are sync’d to Nautilus S3 bucket for later
use in evaluation or other ML applications
Dr. Alex Hurt
Nautilus
for
Accelerated
Research
Computing
8. Deep Learning on Nautilus: By the Numbers
Compute Intensive
• Containerized deep neural architectures: 9
• Datasets trained on: 3
• PyTorch Models Trained: 27
• Training Epochs Completed: 8,100
• Iterations of Training Completed: 30,088,125
• Number of Images Processed: 240,705,000
• Trainable Parameters Optimized:
1,730,368,875 (1.73 Billion)
Data Intensive
• Data loading: 415.8GB
• Neural Model Loading:
124,740 GB
• Wall-Clock: ~77 days
• Human Effort: <3 hours
Dr. Alex Hurt
9. Deep Learning on Nautilus: By the Numbers
Compute Intensive
• Containerized deep neural architectures: 4
• Datasets trained on: 1 – Forest Fire Mapping
• PyTorch Models Trained: 144
• Training Epochs Completed: 14,400
• Iterations of Training Completed: 515,550
• Number of Images Processed: 7,070,400
• Trainable Parameters Optimized: ~23 million
Optimizing time to science
discoveries
• Wall-Clock: ~22 days
• Human Effort: ~13 hours
Anes Ouadou (PhD cand.)
10. Nautilus Supports Jupyter Hub & Jupyter Lab
GEER Excels
• Basic Python Programming
• Used Nautilus Jupyter Lab
• Trainees authenticated with
GitHub accounts
Basic Programming Course
Parallel Programming for High-
Performance Computing
• Multi-Proc, Multi-Thread
• MPI, CUDA
Design and Analysis of Algorithms
• C/C++ Programming
• Analysis with Symbolic Python
• Run-time empirical analysis with Python
Computer Science Courses
Parallel Data Analytics
• Kubernetes
Data Science Courses
11. CC* Team: Great Plains Regional CyberTeam
• PI NSF Award OAC #1925681
• Helping the Great Plains region better leverage
collective cyberinfrastructure resources
• GPN Contributions to Nautilus and the NRP
• GPN developing a CC* Regional Compute proposal,
GP-ENGINE, to deploy NRP nodes into 8 locations
across 6 states
• Continuing our regional training and outreach at
GPN and member REN conferences
Proposed
GP-Engine Sites
12. Contributions to NRP @ MU
Grant Scott & Derek Anderson, non-NSF resources
Four - 4x A100 Servers
• 16x A100 GPUs
• 4x 1TB System RAM
• 4x 128 CPU Cores
• 110,592 CUDA Cores
• 1.2TB GPU RAM
• Online Feb 3, 2023
gpn-fiona-mizzou-1
gpn-fiona-mizzou-2
gpn-fiona-mizzou-3
gpn-fiona-mizzou-4
Working with Materials Scientists
• Physics constrained neural models, flat optics
development
• Automated characterization of nanoparticle
energetic reactions
Computer Vision for Geospatial & Remote
Sensing Analytics
• Wildfire Mapping from Space
• Drought and Flood Mapping from Space
• Illicit Deforestation Detection
• Indigenous peoples mapping for healthcare
delivery
13. Regional Perspective from NevadaNet
4NRP Workshop, La Jolla CA
RENs & NRPs Ecosystem Panel
February 2023
Scotty Strachan, Ph.D.
Nevada System of Higher Education
Principal Research Engineer
Mountain Systems Scientist
Co-PI, NSF-EPSCoR Track 1 + CC*’s
@ScottySci
sstrachan@nshe.nevada.edu
14. CLASSIC RESEARCH TECHNOLOGY USE & ADOPTION CURVE:
- Why is this the case? [match of immediate need+capability]
- How to make solutions widely useful? [culture shifts imminent]
LAB GROUP
PARETO
CURVE
RESOURCE
UNITS
16. Does it service a wide range of research
workflows?
Is it easy to get access? [onboarding flow]
What is the learning curve to productivity?
[grads]
What is the support model? [facilitation]
Is it a service commitment? [iteration
rate/maint req]
ECOSYSTEM COMPONENTS: VALUE PROPOSITION
17. Nevada example:
one way to overcome
friction is standardizing on
federated access,
(semi)automated
onboarding/provisioning…
…freeing up distributed
research engineers for
more high-touch
project/data facilitation
Coming now: a new
Science/Teaching DMZ
architecture
more UNLV, UNR, DRI
computing & data
infrastructure goes here
National computing & data
infrastructure goes here
18. MANY OF OUR REGIONAL CHALLENGES ARE GEOGRAPHIC
- I think a lot about the future of Wide-Area-Science [trust, community]
- Roles of nrp components in the IoWT [data-to-action, workflows]
- This will be driven by the regional networks [new mission areas]
RESEARCH FUTURES IN THE WEST
19. Georgia Tech / Southern Crossroads (SoX)
Relatively new to the NRP….
SoX is deploying nodes as part of CC* Area 2 regional awards that include several
HBCUs in Alabama and Georgia
FIONA nodes purchased as DTNs or perfsonar test points and are considering
redeploying as nautilus nodes
Goal is to enable research within our community and looking to assist faculty
unfamiliar with the platform
SoX is looking for toolkits, demos, training we can share, invite, host, etc. for our
participants to help take advantage of the NRP
SoX offers connectivity and space for those interested in hosting nodes
Editor's Notes
Jen Leasure w/The Quilt – the national collaboration of regional research and education networks
As I was preparing for this panel, I thinking the first national research platform workshop that took place about 5.5 years ago at Montana State University.
Its exciting to be here today with you to share about how the PRP/NRP has scaled over this time as well as the collaborations that have been sparked due to this platform and to talk about future directions.
Many of you are familiar with this map of our country’s regional research and education networks.
A Number of Quilt member organizations are represented here at the meeting
This map represents the backbone of each of these organizations
What you don’t see are the last mile connections to approximately 1/3 of our country’s higher education institutions.
Amy’s presentation this morning mentioned the Minds We Need vision – connecting every community college, TCU, HBCU, and minority serving institutions to our national R&E fabric which just one step toward the NSF goal for democratizing access to cyberinfrastructure resources.
Our Quilt community feels strongly about this vision as we are positioned to serve as key partners in collaborating with lesser-resourced institutions and connecting them with the fabric of R&E networks as well as exposing and introducing them to the broader cyberinfrastructure ecosystem represented by all of you at this meeting today.
Panelist represent regional networks in various stages of implementing nautilus nodes on campuses with their regional network partners