# TRE sustainability and operations ## Overview ### Summary Sustainability needs to be long term, but how do you plan for it when the scenario may change in 5 years? There is also an issue with research, this is a service yet funding requires teams to appear to be doing something new each time, and funders often prefer not to pay for infrastructure (also challenges with cost estimates and under/over expenditures). There are several variables and questions about whether they should be free at point of use (distributing against overheads), or whether to employ a membership user model, a project fee model, standard features being free but charging for high demanding ones or something else. In all cases at least some core funding is required to ensure continuity, specialisation and quality. What we want to ensure is that a public service exists. ### Next Steps - Create a roadmap that focuses on: - Technical skillsets - Information governance requirements - 10 year funding plan ## Raw notes Sustainability from funding perspective beyond the initial 5 years - But what are things going to look like in 5 years time CL centrally funded model - Service in place, refreshed but need to appear to do something different each time to secure funding. **Why different?** - How costing then? Free at point of use, cost distributed against overheads. - Constrain in the cloud? Barts recover work space costs from research projects, distributed central cost on a membership/license/user model - Difference between model for internal and external users. Standard provision free, high storage/compute needs to be recovered - More paperwork to create and chase invoices. no funders like paying for infrastructure What counts as core if it was funded? - Duties imposed as data controllers law, or interpretation runs counter to wants of researchers Folk specialising, if it doesn't get funded for the future that capability is lost. Regional SDE model might lead the way of costing-funding-recovery Some central funding Specialist areas - operational team - Different environments work differently from researcher perspective Sustain people Business and operations to use OS TRE safely and securely what is the perfect TRE/SDE environment future consolidation Software development can be amortised across the community SERP tenant Training component Who provides desk-side support Tracking usage, egress process, layers of tools and processes that need to be in place In/out nature of TRE, tiered sensitivity? Commercial sensitivity. Has auditability in the TRE, does it need to be? - Why different for UCL TRE? Difference in TRE makes funding case easier, adding something new made it more interesting. Using research funding to backfill Estimate in advance what project is likely to use, operational costs, usually completely wrong and go over project - Not sustainable to go consistently over budget - Bill after usage is best, but challenging for proposal/funding Cliff edge, have funding but only sufficient for 1 year not 3 years of project. Following Access to HPC model What can you take off the board if problem is solved strategically - Good training for Data scientists: SC like training relevant to disciplines Seems like we're trying to boil the ocean - VDI, Excel may be R, Stata - Developing things to deal with core use case Core capabilities, exceptional stuff is great, but majority, early stage users, standardise and simplify. Whatever it is, what's missing the ability to understand data. GIGO Standardisation of data makes it seem simpler than it is, reproducibility? AI/ML store data for XX years, is it readable in that time? Who picks up the storage costs for the data. Guidance How can we make it more transparent Constrained with the current model. Guidance provided by RCs, institutional risk as the org have underwritten the project. This breakout room continued during the second round Concerned about being able to provide a service, don't control budgets - Sustainability of providing a public service, rather than generating a business case SNSDE comes under DH budgets, makes things easier HDRUK MRC led 20 year vision 5 year cycle - UKBB core underpinning funding - Fund TREs for 3-5 years for specific projects - Specific use cases not currently supported - Individual researchers and work with them and the RO. - Free at the point of use funding? - Provide underpinning capacity? What is ONS Model? - Free at point of access - Don't know how the budget is secured - Funding comes through different sources ADR UK - Research proposal, existing staff funding or contracted. - For commercial and public researchers usage has to be for public good, commit to publishing and not for profit - Virtual machines provided some policy for standardising storage/compute available - Trying to enable research Driven by what researchers ask for - Intrinsic limit on budget call - Budget for a specific network/platform - Leverage external investment - Some Pharma match funding - Universities also fund Move to long term funding - Strategic level of funding, buffered from long-term budget - Hub large funding but cliff-edged Free at the point of use - Incentivised-disinsentivised, equity of access - Power users can over-consume, less accountability not having to justify use consuming data token publication and harvesting data for private use - Free at point of access so data is freely accessible - Reminder: Don't offer data for commercial use Challenges: - Ingress-egress labour intensive to pour human eyes - Automation tools for validating statistical disclosure test - Skilled job - Tools and more people-more efficient tools; more people would always be good. - All TREs have these issues, share the solutions More automation -IDS (Integrated Data Service- SRS Secure Research Service - Free at point of use?? Cuts out some of the applications automated validation of inputs Understand the whole pathway - Fix one part and it just shows the next bottleneck - Fraunhoffer 1/3-1/3-1/3 lights_on-academic-commercial_activity - Sustainability, prime an initiative without committing to long term investment More people - more monkeys on typewriters Over focus on the medical use case currently, needs to rebalance. Better understanding and economy of scale from small numbers. - Focus critical mass on small number - DARE UK would create a TRE to handle data as an offering What is a TRE? - At what point does a federated TRE network become a single TRE? - TT: At the point at which you have seamless transition between TREs? Trust that the analysis/code is running as intended? ### Roadmap plan #### Questions - What would a solution to this problem look like? - What resources would be needed (people, time, funds, infrastructure etc.)? - How can this community support you in getting them? - What working groups/orgs are already working on this, if any? How can we collaborate with them effectively? #### Roadmap: A roadmap should address - Technical knowledge, skills, TRE staff skillsets - Why doing this has to be part of retaining people - Localising staff makes this easier, central models push more to thinking about pay - To address retention - Pipeline of talent - Can TRE model work in R - Not just technical, IG, where can I get more information - Consultancy - Embedded technical/operational/IG knowledge relevant to the problem. - Research - teaching balance. - Funding - Lots of politics, in HPC communities, good for those who get it. Not good for those who have to resort to begging - Not necessarily good for SDE - Analysis will follow data - People with data will need to bolt compute - HPC allocation modelled SDE account for compute/storage costs - Why should SDE and HPC be considered differently 10 year plan - scope for accreditation - Chartered research infrastructure? - CSP platform neutral certifications for Data/Cloud Infrastructure sustainability People: - Infrastructure/Developers - Operations - Data Scientists