Geoextent extraction
Extract geospatial and temporal extent from your data files or remote repositories. This tool analyzes files and returns bounding boxes, time ranges, and other metadata.
Extraction options
Extracted extents (0)
Copy extents stores the extracted geometries in your browser for 5 minutes, so you can paste them into a work's contribution form. After that window they are discarded automatically.
Documentation & supported formats (geoextent v0.13.1.dev15+g8fbbe05d6)
Supported file formats
Formats are dynamically loaded from geoextent's features API.
ZIP archives: Upload ZIP files containing multiple data files. The extraction will process all supported files within the archive.
Supported repository providers
Provider information is dynamically loaded from geoextent's features API.
Wikidata is a free and open knowledge base that provides structured data to Wikipedia and other Wikimedia projects. Geographic extents are extracted via SPARQL queries for coordinate location (P625) and other geographic properties.
Dryad is a nonprofit curated general-purpose repository that makes research data discoverable, freely reusable, and citable with DOIs. It specializes in data underlying scientific publications and accepts data in any file format from any field of research under Creative Commons Zero waiver.
4TU.ResearchData is a Dutch national data repository for science, engineering, and design. Hosted by the 4TU Federation of Dutch technical universities, it assigns DOIs and provides long-term data archiving.
Figshare is an online open access repository where researchers can preserve and share their research outputs including figures, datasets, images, and videos. It allows researchers to publish files in any format with assigned DOIs and tracks download statistics for altmetrics.
Zenodo is a free and open digital archive built by CERN and OpenAIRE, enabling researchers to share and preserve research output in any size, format and from all fields of research. It assigns persistent DOIs to all submissions and stores data in the CERN Data Center for long-term preservation.
Generic provider for InvenioRDM-based research data repositories. Supports multiple institutional instances sharing the same platform and REST API.
PANGAEA is a digital data library and publisher for earth system science, hosted by the Alfred Wegener Institute and MARUM in Germany. It archives and publishes georeferenced data from earth system research with DOI assignment and holds around 375,000 datasets comprising over 13 billion data items.
The Open Science Framework (OSF) is a free and open-source project management tool developed by the Center for Open Science that facilitates open collaboration in science research. It enables researchers to manage, store, and share documents, datasets, and other research materials throughout the project lifecycle with version control and integration capabilities.
Dataverse is an open-source web application for sharing, preserving, citing, exploring, and analyzing research data developed at Harvard University's Institute for Quantitative Social Science. The Harvard Dataverse Repository is one of the largest repositories of open research data in the world with thousands of datasets across all disciplines.
GFZ Data Services is a curated research data repository for the geosciences domain, hosted at the GFZ German Research Centre for Geosciences in Potsdam. It has assigned DOIs to geoscientific datasets since 2004 and provides comprehensive consultation by domain scientists and IT specialists following FAIR principles.
RADAR (Research Data Repository) is a cross-disciplinary research data repository operated by FIZ Karlsruhe. It provides DOI assignment and long-term archiving for German research institutions.
NSF Arctic Data Center is the primary repository for NSF-funded Arctic research data. It provides long-term data archiving and supports ISO 19115 metadata with rich geospatial coverage information.
DataONE (Data Observation Network for Earth) federates ~38 member node repositories. Extracts pre-computed spatial bounding boxes and temporal ranges from the Coordinating Node Solr API without downloading data files.
Global Biodiversity Information Facility — the world's largest open biodiversity data network with 2.5B+ occurrence records. Supports metadata-only extraction from the Registry API and optional Darwin Core Archive (DwC-A) data download from institutional IPT servers.
Pensoft Publishers is a scholarly publisher based in Sofia, Bulgaria, specializing in biodiversity and environmental science with over 60 peer-reviewed open access journals. All articles are published under Creative Commons licenses and include semantic enrichments and hyperlinks to facilitate data findability and interoperability.
BGR (Federal Institute for Geosciences and Natural Resources) is the German geoscientific research center providing data and advice on geoscience topics. The BGR Geoportal offers access to geological, geophysical, and hydrogeological datasets with metadata following GeoDCAT-AP and INSPIRE standards.
BAW (Bundesanstalt für Wasserbau / Federal Waterways Engineering and Research Institute) Datenrepository provides research data for waterway engineering, including hydrodynamic models, sediment data, and measurement data. Metadata follows ISO 19115/19139 standards via CSW 2.0.2.
MDI-DE (Marine Daten-Infrastruktur Deutschland / Marine Data Infrastructure Germany) is a distributed spatial data infrastructure for German marine and coastal data. The NOKIS catalog provides ISO 19115/19139 metadata via CSW 2.0.2, with data served via WFS endpoints at various GeoServer instances.
GDI-DE (Geodateninfrastruktur Deutschland / Spatial Data Infrastructure Germany) is the national spatial data infrastructure catalogue with 771,000+ records, aggregating metadata from German federal, state, and municipal agencies (BKG, DWD, DLR, etc.).
OPARA is the Open Access Repository and Archive for research data of Saxon universities, jointly operated by TU Dresden and TU Bergakademie Freiberg. It offers free archiving for at least ten years and open access publishing of research data with DOI assignment, running on DSpace 7.x platform.
Senckenberg Biodiversity and Climate Research Centre operates a CKAN-based data portal providing access to biodiversity, climate, and geoscience research datasets. Supports both open access and metadata-only restricted datasets with rich taxonomic and temporal coverage metadata.
Generic provider for CKAN (Comprehensive Knowledge Archive Network) instances. CKAN is an open-source data management system used by government agencies, research organisations, and other institutions worldwide to publish and share open data.
Mendeley Data is a free and secure cloud-based data repository by Elsevier where researchers can store, share, and publish research data. It assigns DOIs to all published datasets and supports any file format.
DEIMS-SDR (Dynamic Ecological Information Management System - Site and Dataset Registry) is a metadata registry for long-term ecological research sites and datasets, powered by eLTER. It catalogues environmental research and monitoring facilities globally, with rich geospatial metadata (WKT boundaries, temporal ranges).
NFDI4Earth Knowledge Hub is a Cordra-based digital object repository for Earth System Sciences with 1.3M+ datasets, powered by a SPARQL endpoint. The OneStop4All portal provides search/discovery.
HALO-DB is the web platform of a data retrieval and long-term archiving system for data based on observations of the HALO research aircraft (High Altitude and LOng Range), operated by DLR (German Aerospace Center). Contains ~9,800 datasets from 115+ scientific missions covering atmospheric science, geophysics, and earth observation.
SEANOE (SEA scieNtific Open data Edition) is a marine science data repository operated by Ifremer/SISMER (France). It publishes open-access oceanographic, marine biology, and geoscience datasets with DOI prefix 10.17882.
GeoScienceWorld is a publishing platform hosting geoscience journals from multiple publishers (SEG, GSL, Mineralogical Society, etc.). Articles include GeoRef metadata with geographic coordinates as WKT.
Open Journal Systems (OJS) journal article landing pages. Detects the OJS generator meta tag and extracts geospatial metadata embedded by the ojsGeo plugin (Dublin Core SpatialCoverage GeoJSON / WKT, ICBM, geo.position). Works across any OJS-hosted journal that has the plugin installed; pages without it are still recognised as OJS but return no spatial extent.
Janeway journal article landing pages (https://janeway.systems/). Detects Janeway via the geo+json alternate link emitted by the janeway_geometadata plugin (https://github.com/GeoinformationSystems/janeway_geometadata/), or via the platform's static asset paths combined with the /article/id/{N}/ URL pattern. Extracts JSON-LD spatialCoverage, DC.SpatialCoverage (GeoJSON/WKT), DC.box, ISO 19139, DC.temporal, and follows the alternate geo+json link to the plugin's canonical export.
UKCEH (UK Centre for Ecology & Hydrology) operates the Environmental Information Data Centre (EIDC). It publishes environmental science datasets including water chemistry, land cover, biomass, and atmospheric data with DOI prefix 10.5285.
SpatioTemporal Asset Catalog (STAC) is an OGC Community Standard for describing geospatial information. Supports extraction of spatial bounding boxes and temporal intervals from STAC Collections served by any STAC-compliant API.
GitHub is a platform for hosting and collaborating on code and data. This provider downloads geospatial files from public GitHub repositories and extracts their spatial and temporal extent.
GitLab is a platform for hosting and collaborating on code and data. This provider downloads geospatial files from public GitLab repositories on gitlab.com and self-hosted instances, and extracts their spatial and temporal extent.
Forgejo and Gitea are community-driven git hosting platforms. This provider downloads geospatial files from public Forgejo/Gitea repositories (including Codeberg.org) and extracts their spatial and temporal extent.
Software Heritage is a non-profit archive (Inria + UNESCO) of all publicly available source code, assigning persistent identifiers (SWHIDs) to every software artifact. This provider downloads geospatial files from archived repositories and extracts their spatial and temporal extent.
Direct HTTP(S) URLs to GeoTIFF/COG files. Reads raster headers via GDAL /vsicurl/ without downloading the full file. Works best with Cloud Optimized GeoTIFFs (COG) but supports any HTTP-accessible GeoTIFF.