EUDAT and PRACE are joining forces to help research communities gain access to high quality managed e-Infrastructures whose resources can be connected together to enable cross-utilization use cases and make them accessible without any technical barrier.
The capability to couple data and compute resources together is considered one of the key factors to accelerate scientific innovation and advance research frontiers. This collaboration has already achieved relevant results as five cross-projects, ranging from different scientific fields, have been granted to use approximately 400TB of storage space at different EUDAT centres. These projects, granted during the last PRACE DECI Call 13, are now using EUDAT storage resources to secure the results of their computational experiments and can build on this result to create a long-term repository of valuable data for further elaboration and dissemination.
Thanks to this collaborative effort, the PRACE DECI Call 14 welcomes new joint project proposals requiring to couple PRACE Tier-1 resources with services for long-term data management and preservation. EUDAT offers around 1 PB (1000 TiB) of storage capacity, approximately 150 TiB per project awarded at one or more of its sites distributed across Europe for at least 24 months after the end of the DECI program grant, free of charge.
How will the projects be selected?
EUDAT will categorize projects on the basis of their estimated level of complexity, the amount of requested resource and the long-term strategy.
Proposals will be shortlisted according to the following criteria:
- Relevance of the project with respect to the current EUDAT service roadmap,
- Organizational feasibility and sustainability aspects of the project with regards to the expertise and human resources which are available at EUDAT and the applicant sites,
- Technical feasibility with regard to the availability of storage and network capacities and capabilities which can be provided via EUDAT,
- Expected benefits of the project for the immediate research community,
- Expected benefits for European researchers beyond the immediate research community from which the project originates, and
- Contribution to Open Access.
Data deposited onto EUDAT resources will be available through different interfaces, including GridFTP, and maintained close to PRACE facilities, machines and network, to facilitate further usage and transmission.
Further to the storage resources, EUDAT experts will work to facilitate integration and use of its data services for finding, sharing, storing, replicating, staging and performing computations with primary and secondary research data.
Case 1: The community A, to ensure that the results of its climate simulations are made public and shared with colleagues from other research groups, needs to store 120TB of NetCDF data sets for 36 months on a trustworthy site and make this archive be available through a Thredds/ESGF web service open-access.
Solution 1: EUDAT will assign the project to one of its sites which is considered trustworthy for the community, and able to provide the requested storage capacity and to host the Thredds service instance for the duration of the project and beyond. EUDAT has a dedicated team who takes care of the enabling of data projects and that can ensure that all community requirements are properly interpreted and implemented. Once the resources are secured and the services configuration, the members of the community can start transferring the data from one infrastructure to the other using the well-known GridFTP protocol. Transferred data will automatically end-up into the community archive and appear into Thredds.
Frequently asked questions
Q1: Which kind of data I can deposit onto EUDAT resources?
A1: Any kind of structured or unstructured data are allowed. The EUDAT B2SHARE service is also able to manage metadata records and can be extended to support your own metadata schema.
Q2: How can I transfer data?
A2: EUDAT supports two main transfer protocols, GridFTP and HTTP.
Q3: What’s a Data Management Plan (DMP)?
A3: The DMP is a useful instrument for researchers to reflect on and communicate about the way they will deal with their data. It prompts them to think about how they will generate, analyse and share data during their research project and afterwards. EUDAT, in collaboration with OpenAIRE, offers free training initiatives, including webinar, to help researchers develop a Data Management Plan for their data sets.
Q4: Will my data be secretly mint or manipulated to collect statistics or extract new insights from them?
A4: No, EUDAT ensures integrity and stewardship of all stored data. The EUDAT infrastructure is based on trustworthy partners.
Q5: Can I use my PRACE X.509 credentials to access EUDAT services?
A5: Yes, EUDAT supports X.509 as authentication mechanism and is working with PRACE colleagues to ensure seamless access to services.
Q6: Does EUDAT provide computational power for data postprocessing and further data analysis?
A6: The EUDAT infrastructure includes some of the largest HPC centres in Europe and the provisioning of computing cycles can be negotiated directly with the hosting centre. EUDAT also offers cloud computing resources for hosting community special services which consume deposited data.
Q7: Does EUDAT provide any material, such as documentation, on how its services work?
A7: Yes, the user documentation is available here.
Q8: Who are the communities EUDAT is working with?
A8: Since the beginning the collaboration with research communities has been one of the EUDAT constituent pillars. The list of active collaborations is available here.
Q9: Does EUDAT provide support to communities?
A9: Yes, EUDAT has a dedicated support helpdesk for all its services. In addition, during the enabling phase, a dedicated amount of manpower will be allocated to facilitate services uptake and use. Requests for support can be submitted via this web form.