A Bioinformatics Core Matures: Growing and maintaining customer base while approaching cost-recovery James Cavalcoli, PhD University of Michigan Ann Arbor Outline • Cost recovery • Expanding services • Identifying new customers • Retention of Customers and Staff/Faculty UM Bioinformatics Core Bioinformatics: The science of analyzing and interpreting complex biological data The Bioinformatics Core was created in CCMB (ca. 2009) and established as a Biomedical Research Core Facility by the Office of Research in 2012 Our mission is to meet the growing bioinformatics analysis needs of researchers We help interpret complex, high-throughput biological data (DNA, RNA, Protein) 3 Cost recovery • In general, Cores at UM are supposed to be selfsupporting through recharges back to researchers • Bioinformatics Core’s major costs are salaries – software, cluster usage – Hardware - servers Cost recovery - hurdles • Developing methods and pipelines without a specific paying project. • Accurately tracking time on custom projects • Being able to charge enough to cover costs without breaking investigators banks. Recharge 35% Methods Develop ment 65% Cost recovery – cont’d • Service charges need to include the cost of developing the pipelines / automation for repeatable tasks • Need to recover the time for “learning” a new method, not just the time to run the method • Accurate estimation of time needed is difficult to obtain • Project management effort too ! 6 4/22/2015 Current Trend in Growth Growth of Core Utilization 120 • Revenue was $221,558. Goal was 265K – pretty close !! • # Projects and Investigators are on the right trajectory • Changes in revenue from recharge rate restructuring and from increase in projects too 100 80 60 40 20 0 Schools Departments Investigators Projects 2011 3 12 13 16 2012 5 27 20 25 2013 4 24 33 55 2014 4 26 66 109 Designing Services • Identifying repeatable tasks – A process/method that many will use • Quantitating time / effort into a per sample cost – Resistance and questioning of “hourly” charges – Incorporating effort to develop the repeatable pipeline • Estimating time/effort for novel / custom projects – Custom projects have differential risk, and are assigned to bins of size/risk/effort Expanding Services • New methods come out (Tophat, STAR, etc) • New Technology / platform (PacBio, minIon?) • New utilization / request from researchers (gene panels, Hi-C) ** Need to validate new methods to insure proper results are given and reproducible. When to expand? • When is a new service warranted?? – The genomics core is adding a platform – Multiple requests for the same type of analysis • When do you turn a service into a pipeline? – Need to improve the time to deliver, reduce internal effort – Large increase in requests and samples How to Expand Customer Base? • Roadshows !! – Present at departmental faculty meetings – Describe the core, services – Give examples of projects we’ve worked on • Recruit / Identify External Customers – Other colleges and universities – Biotech / Commercial • Create a regional resource ? We help you throughout your research lifecycle Project Planning Experiment Design Grant Support (Letters) Sequencing Methods Education and Training Data Requirements Execution Data Analysis Options Validation Support Methods/Softwa re Development Post-review Follow-up Analysis Annotation, Integration, Visualization Results Interpretation 12 Publication Data Analysis Sample Size Roadshow example slides Our Standard Services 13 Genome Assembly Identify disease associated variations in DNA Analyze gene expression and alternative splicing Detect regulatory elements and chromatin structure Explore genomes of unsequenced organisms Types DNA Methylation and Protein Binding Whole Genome Seq (WGS) Exome Seq (WES) Amplicon RNA-Seq Microarray ChIP-Seq MeDIP-Seq RRBS Methods RNA Expression Alignment, Variant identification Alignment, Transcript assembly Differential expression Alignment, Peak/signal detection De-novo assembly, Homology mapping, gene prediction Results Genome Re-sequencing Nucleotide polymorphisms, Copy number and structural variations Differential gene expression, Alternative Splicing, Novel transcripts Transcription factor binding, Histone modification, Methylation state Draft genome, Contigs, Functional annotation De-novo sequencing Metagenomics Custom Support for your Research When you need something beyond an established support service … Come talk with us ! Research Support • Develop new methods (amplicon seq; pangenome assembly) • Implement novel analysis workflows (RNA-DNA Editing ) • Evaluate and integrate third-party tools and data sources (Test a new aligners or variant callers) • Specific analysis for publication figures New algorithms and software • Data integration and visualization (e.g. Jacquard, Winnow) • Database design and implementation (DB for RNA-seq metadata and results; Kretzler) 14 Genomic Drivers in Leukemia and Lymphoma (Sami Malek) 2012 - 2014 : 6 projects (4 Exome-Seq, 1 Gene Panel,1 Custom), 152 samples Functional Impact of novel coding variants (SNPS, InDels) Sample Pos ID Somatic Variant Detection Sample 1 chr11 Exome-Seq FL, CLL Ref/Alt A /G Var Effect Type SNP Sample 1 chr11 C/T SNP Sample 2 chr10 -/T IINS Sample 2 chr10 TTT/- DEL Class STOP GAINED AA NONSENSE E104* NON SYNONYMOU MISSENSE S CODING NON SYNONYMOU MISSENSE S CODING NON SYNONYMOU MISSENSE S CODING GENE Gene A N3173D Gene A D361H Gene B E306K Gene B • Novel coding variants identified • Validated with Sanger sequencing Gene-wise listing of variants across 60 samples with functional impact Targeted Gene Panel Prevalence of targeted variants in patient populations 10 genes 60 FL samples Custom Analysis Publication Support 15 Analysis Request • How well do Exome SNP calls match up to SNP6.0 array calls? Method Result • Exome Sensitivity/Specificity analysis against SNP6.0 • Exome SNP calls had 90% sensitivity and 99% specificity Cross-platform Comparison (Exome vs. Microarray) How to maintain growth and quality? • Core service offerings need to managed effectively to insure: – – – – Uniformity of service (cost, time) Quality and reproducibility Resource allocation Customer satisfaction Project & Portfolio Management Customer Retention • • • • • Personal service Informed data handoff; Quality of data Clarity of deliverables and costs Responsiveness to (reasonable) requests Discussing the science before and after the experiment and analysis (Collaboration!) 17 4/22/2015 Staff/faculty Retention • Avoiding boredom and burnout – New Challenges; continuous learning • Building teamwork and cross-training. – Maintain a ‘research’ experience where the core can drive the science • Competing with Industry salaries? – University vs. Commercial – tough to do – Make sure the environment is so good they won’t leave! Bioinformatics Core Team Analysis Team Rich McEachin Ana Grant Manjusha Pande Development Team Weisheng Wu Rajasree Menon Outreach & Training Chris Gates Pete Ulintz Marci Brandenburg Jessica Bene Operations Management Jim Cavalcoli Director Ashwini Bhasi 19 Summary • Cost Recovery improved by good time tracking, appropriate rate structures; increasing throughput • Improving and expanding services is a constant challenge • Outreach is essential for new customer ID • Collaboration and listening to existing customers improves retention • Keeping your staff/faculty engaged is important to longterm consistency Thank you ! The UM Bioinformatics Core is supported by the Office of Research, Medical School Administration and the Biomedical Research Core Facilities at the University of Michigan School of Medicine. 21 4/22/2015
© Copyright 2024