File formats
This document lists the preferred and accepted file formats that can be used in data publication in the Data Repository.
Preferred formats
Preferred formats are file formats that are accepted and can be curated on the long term. Generally, file formats that are open, free to use, considered standard formats in relevant communities and commonly used are preferred.
Images
Type |
Extensions |
Mimetypes |
JPEG |
jpg, jpeg |
image/jpeg, image/jpg |
JPEG2000 |
jp2, jpeg2 |
image/jpeg2000 |
GIF |
gif |
image/gif |
PNG |
png |
image/png |
TIFF |
tiff, tif |
image/tiff |
Text
Type |
Extensions |
Mimetypes |
Text |
txt, text, md |
application/plain, text/plain, text/markdown |
Documents
Type |
Extensions |
Mimetypes |
Closed |
pdf |
application/pdf |
Open |
odt, ods, odp |
application/vnd.oasis.opendocument.text, application/vnd.oasis.opendocument.spreadsheet, application/vnd.oasis.opendocument.presentation |
Movies
Type |
Extensions |
Mimetypes |
Container |
mp4, avi, mkv, mov, wmv |
video/mp4, application/x-troff-msvideo, video/avi, video/msvideo, video/x-msvideo, video/x-matroska, video/quicktime, video/x-ms-wmv |
Tabular data
Type |
Extensions |
Mimetypes |
HDF |
hdf5, h5, hdf4 |
application/octet-stream, application/x-hdf5, application/x-hdf4 |
NETCDF |
nc, cdl |
application/x-netcdf |
Comma-separated |
csv, txt |
text/csv, text/txt |
GeoTIFF |
geotiff |
application/geotiff |
Accepted formats
Accepted formats are file formats that are accepted during deposition and are bitwise preserved for the duration of publication, but will receive no further curation on the long term. This usually is the case because file formats are not open, not properly described, or are considered obsolete. All preferred formats are accepted, as well as the following additional formats:
Documents
Type |
Extensions |
Mimetypes |
Office |
doc, docx |
application/msword, application/vnd.openxmlformats-officedocument.wordprocessingml.document |
Tabular data
Type |
Extensions |
Mimetypes |
Office |
xls, xlsx |
application/vnd.ms-excel, application/vnd.openxmlformats-officedocument.spreadsheetml.sheet |
Sequencing
Type |
Extensions |
Mimetypes |
DNA sequencing base |
fastq, fq |
chemical/seq-na-fastq |
DNA protein sequencing |
fasta, fa, fna, fsa, mpfa |
chemical/seq-na-fasta, chemical/seq-aa-fasta |
Xml
Type |
Extensions |
Mimetypes |
Mass spectroscopy |
mzxml |
application/xml |
Container
Type |
Extensions |
Mimetypes |
tar |
tar, gz |
application/tar, application/gzip |
Preservation and curation
Our preservation and curation policies can be found in the Preservation policy document.
Your file format not listed?
Possibly you have a dataset consisting of file with a file format not listed here. If you feel your file format should be listed here, please contact us.
Disclaimer
The information shown on this page is subject to change without prior notice. Any removal or addition of file formats does not affect any deposits made before this information was changed.