SlideShare a Scribd company logo
Business Intelligence Software For Inquisitive Minds
Typical Scientific Data Workflow 1. Data Acquisition Excel, Access, homebrews (Electronic?!) forms, notes LIMS & instruments output Labmatrix forms & records Other enterprise resources etc… 2. Data Exploration 3.Data Analysis SAS, R Spotfire Tableau Statisticians etc… Easy, graphical queries ETL & data cleaning tools  Formulas & calculations Visualize charts & graphs Exploration Ad-hoc Query Relationships Intelligence Decisions Acquisition Analysis Analysis
Once you have: Collected…  (  ) Standardized…   (Not yet? Use built-in data cleaning tools) Normalized…   (Not yet? Use built-in formula calculation tools) … some, or all of your  project data ,  how do you best make use of them?
Piles of project data from various sources Domain experts with many complex data questions Programmers DB The Problem:  subject matter experts having to go through a (limited) pipeline of IT expertise to answer complex questions about their domain-specific data.  IT DB DB DB DB
IT / Programmers Domain Experts / Researchers Can’t access data by myself My data inquiries are taking too long to process I have many more inquiries but afraid to ask IT misinterprets my inquiries Changed my mind about inquiries in process already Data result doesn’t look right Didn’t IT know I need to relate A with B in this specific way? … Too many throw-away or one-off project requests They keep changing their minds about how to cut the data Nothing is standardized No prioritization: using brute force approach to grind through all data instead of critical path Could use more domain expertise when processing piles of complex data … DNA! Biomarkers! Transcription! Primary key! Data type! Object model! Clashing of Expertise
IT / Programmers DB The Solution:  Domain Experts Common workspace Shared “language” All raw & prepared data can be centralized here. The data processes and data queries are shown graphically, so they are easily understood by both IT and domain experts. DB DB DB DB centralize prepare data query data
IT / Programmers Domain Experts / Researchers Can explore data by myself Get results from complex questions in minutes instead of weeks Gain actionable insights even from rough or messy data (within institutional guidelines) Visually share interesting data queries with colleagues Visually share data workflows and issues with IT personnel Help IT identify data issues and prioritize fixes … Centralized environment to prepare and present data sets Built-in import, data cleaning, standardization & ontology tools Centrally manage data access and audit all changes and activities Prepare and fix data issues with guided priority from end-users Develop & reuse code for projects via programmatic interface Self-serve model allows IT to work on other things … Symbiotic Expertise
Symbiotic Expertise =  smarter & less IT efforts, faster & better data access for domain experts With the ability to explore data easily, domain experts can quickly identify  relevant  data, gain actionable insights, and better drive efforts SEA OF DATA
Meds Patients Step 1.  Drag & drop a set of data on top of another. How does   work? Patients on Meds Meds Step 2.  Data sets are intelligently and automatically connected to each other. Patients Filter Step 3.  Expand the scope and detail of your question with additional data sets, filter conditions, calculations, or other kinds of transformations as necessary. Each “node” is live, so you can retrieve and review the results from each step as you build a complex query.  Result Set 1 Result Set 2 Combine Filter Pivot You are now trained in using Qiagram.
Current Client Application Areas: Clinical & Translational Research Biomarker Discovery Healthcare Data Utilization/Consumption In silico  Clinical Trial Feasibility Consortium Collaborations Cheminformatics Research …
Cryptic DB you’ll never  have easy access to Case Study: Common Problem in Translational Research
The Solution Qiagram : our award-winning “draw-your-question” interface - SQL or programming training NOT required! Just drag & drop, and run your query!
Qiagram: a visual data query tool Example 1: “reporting & operational statistics” data query
Qiagram: a visual data query tool Example 2: hypothesis-driven data exploration
Qiagram: a better BI tool for translational research (TR) ... the exploratory & discovery nature of TR requires tools specifically designed for TR endeavors, instead of shoe-horning traditional BI technologies. Traditional BI  TR Informatics Budget $$$ $ Purpose Operational Exploratory  Questions Simple Complex Data Cleaning & Standardization Precursor to meaningful queries Parallel to meaningful queries Data Sources Well understood Ever-changing Data organization Hierarchical Ad hoc Perspective Static Individualized Collaboration Limited Extensive
An enterprise, scalable solution that communicates with  all  data sources SOAP tab-delimited text DB SOAP RMI HTTP DB Large Flat  Files DB Web Forms,  Data Files XML Java Objects .TXT Federation Engine DB DB Many ways to get data into the system: Qiagram Framework ETL Framework Data  Transformer Qiagram Core API Custom Web  Services Enterprise System RMI API Enterprise System WEB UI SQL Scripts
Centralize Data:  web-accessible  system enables immediate data staging, multi-site collaboration, data/site management, data QA/review/reports, and instant data querying results; scalable enterprise deployment Clean & Standardize:  improve data quality via built-in data cleaning and standardization tools; establish or import vocabularies & standardized data models Enforce User Roles & Permissions:  flexible configurations of how different users/groups/TAs can access specific data sets in collaborative settings Maintain Security & Compliance:  transmit data securely, facilitate regulatory compliance, and track all data changes via detailed audit logs in this HIPAA/PHI-compliant system; customizable data backup & recovery plans Integration & Interoperability:  multiple interfaces to communicate with other data systems in your IT infrastructure; vocabulary & ontology definitions KEY FRAMEWORK FEATURES
Proprietary & Confidential Qiagram: accolades

More Related Content

Qiagram

  • 1. Business Intelligence Software For Inquisitive Minds
  • 2. Typical Scientific Data Workflow 1. Data Acquisition Excel, Access, homebrews (Electronic?!) forms, notes LIMS & instruments output Labmatrix forms & records Other enterprise resources etc… 2. Data Exploration 3.Data Analysis SAS, R Spotfire Tableau Statisticians etc… Easy, graphical queries ETL & data cleaning tools Formulas & calculations Visualize charts & graphs Exploration Ad-hoc Query Relationships Intelligence Decisions Acquisition Analysis Analysis
  • 3. Once you have: Collected… (  ) Standardized… (Not yet? Use built-in data cleaning tools) Normalized… (Not yet? Use built-in formula calculation tools) … some, or all of your project data , how do you best make use of them?
  • 4. Piles of project data from various sources Domain experts with many complex data questions Programmers DB The Problem: subject matter experts having to go through a (limited) pipeline of IT expertise to answer complex questions about their domain-specific data. IT DB DB DB DB
  • 5. IT / Programmers Domain Experts / Researchers Can’t access data by myself My data inquiries are taking too long to process I have many more inquiries but afraid to ask IT misinterprets my inquiries Changed my mind about inquiries in process already Data result doesn’t look right Didn’t IT know I need to relate A with B in this specific way? … Too many throw-away or one-off project requests They keep changing their minds about how to cut the data Nothing is standardized No prioritization: using brute force approach to grind through all data instead of critical path Could use more domain expertise when processing piles of complex data … DNA! Biomarkers! Transcription! Primary key! Data type! Object model! Clashing of Expertise
  • 6. IT / Programmers DB The Solution: Domain Experts Common workspace Shared “language” All raw & prepared data can be centralized here. The data processes and data queries are shown graphically, so they are easily understood by both IT and domain experts. DB DB DB DB centralize prepare data query data
  • 7. IT / Programmers Domain Experts / Researchers Can explore data by myself Get results from complex questions in minutes instead of weeks Gain actionable insights even from rough or messy data (within institutional guidelines) Visually share interesting data queries with colleagues Visually share data workflows and issues with IT personnel Help IT identify data issues and prioritize fixes … Centralized environment to prepare and present data sets Built-in import, data cleaning, standardization & ontology tools Centrally manage data access and audit all changes and activities Prepare and fix data issues with guided priority from end-users Develop & reuse code for projects via programmatic interface Self-serve model allows IT to work on other things … Symbiotic Expertise
  • 8. Symbiotic Expertise = smarter & less IT efforts, faster & better data access for domain experts With the ability to explore data easily, domain experts can quickly identify relevant data, gain actionable insights, and better drive efforts SEA OF DATA
  • 9. Meds Patients Step 1. Drag & drop a set of data on top of another. How does work? Patients on Meds Meds Step 2. Data sets are intelligently and automatically connected to each other. Patients Filter Step 3. Expand the scope and detail of your question with additional data sets, filter conditions, calculations, or other kinds of transformations as necessary. Each “node” is live, so you can retrieve and review the results from each step as you build a complex query. Result Set 1 Result Set 2 Combine Filter Pivot You are now trained in using Qiagram.
  • 10. Current Client Application Areas: Clinical & Translational Research Biomarker Discovery Healthcare Data Utilization/Consumption In silico Clinical Trial Feasibility Consortium Collaborations Cheminformatics Research …
  • 11. Cryptic DB you’ll never have easy access to Case Study: Common Problem in Translational Research
  • 12. The Solution Qiagram : our award-winning “draw-your-question” interface - SQL or programming training NOT required! Just drag & drop, and run your query!
  • 13. Qiagram: a visual data query tool Example 1: “reporting & operational statistics” data query
  • 14. Qiagram: a visual data query tool Example 2: hypothesis-driven data exploration
  • 15. Qiagram: a better BI tool for translational research (TR) ... the exploratory & discovery nature of TR requires tools specifically designed for TR endeavors, instead of shoe-horning traditional BI technologies. Traditional BI TR Informatics Budget $$$ $ Purpose Operational Exploratory Questions Simple Complex Data Cleaning & Standardization Precursor to meaningful queries Parallel to meaningful queries Data Sources Well understood Ever-changing Data organization Hierarchical Ad hoc Perspective Static Individualized Collaboration Limited Extensive
  • 16. An enterprise, scalable solution that communicates with all data sources SOAP tab-delimited text DB SOAP RMI HTTP DB Large Flat Files DB Web Forms, Data Files XML Java Objects .TXT Federation Engine DB DB Many ways to get data into the system: Qiagram Framework ETL Framework Data Transformer Qiagram Core API Custom Web Services Enterprise System RMI API Enterprise System WEB UI SQL Scripts
  • 17. Centralize Data: web-accessible system enables immediate data staging, multi-site collaboration, data/site management, data QA/review/reports, and instant data querying results; scalable enterprise deployment Clean & Standardize: improve data quality via built-in data cleaning and standardization tools; establish or import vocabularies & standardized data models Enforce User Roles & Permissions: flexible configurations of how different users/groups/TAs can access specific data sets in collaborative settings Maintain Security & Compliance: transmit data securely, facilitate regulatory compliance, and track all data changes via detailed audit logs in this HIPAA/PHI-compliant system; customizable data backup & recovery plans Integration & Interoperability: multiple interfaces to communicate with other data systems in your IT infrastructure; vocabulary & ontology definitions KEY FRAMEWORK FEATURES
  • 18. Proprietary & Confidential Qiagram: accolades

Editor's Notes

  1. Data is the lifeblood of the scientific process. Data is used to prove or disprove hypotheses and to guide decision-making. Its lifespan starts with data generation and ends with achieving concrete insights. There is a step in the middle, however, that is often overlooked: data exploration - the stage of hypothesis generation and decision-making during which the researcher takes a high-level, exploratory view of the available data. Data acquisition --> Data exploration --> Data analysis This data exploration stage is often used to determine the availability of "interesting data" that can be used in further analysis.
  2. By removing the barrier of specialized database languages, we enable researchers to directly investigate their data by simply drawing a diagram. This frees up IT to better focus on infrastructure and support operations.