Journal Landing Page Examples ============================== This page demonstrates how to extract geospatial extent from **journal article landing pages** that advertise spatial metadata in their HTML head. Geoextent ships with a ``journals/`` umbrella content provider that detects three publishing platforms out of the box and pulls a normalised bbox / tbox out of whatever encoding the publisher chose. Overview -------- The ``journals/`` umbrella covers: - **OJS** (Open Journal Systems) with the `ojsGeo `_ plugin — fingerprinted via the ```` tag every OJS theme emits. - **Janeway** with the `janeway_geometadata `_ plugin — fingerprinted via the ```` link the plugin always renders, plus the ``/article/id/{N}/`` URL pattern. - **Pensoft** journals (e.g. Biodiversity Data Journal, ZooKeys) — DOI-prefix fast-path on ``10.3897/`` plus a JSON-LD ``contentLocation`` walker. See :doc:`examples` for the existing Pensoft example. For OJS and Janeway you can pass either: - the article landing-page URL (``http://journal-host/article/view/{N}`` or ``/article/id/{N}/``), or - a DOI that resolves to the article (any prefix — the OJS/Janeway providers do not need a per-publisher allowlist). A journal that advertises the platform but does **not** have the geo plugin installed is still recognised; geoextent then returns no spatial extent but will still surface the publication date and, when ``--ext-metadata`` is set, look up the article DOI on CrossRef / DataCite. Source-Preference Priority -------------------------- When a page emits the same article in multiple encodings, the provider picks the one most likely to carry the richest geometry, *not* the one easiest to parse. Priority order (first match wins): 1. JSON-LD ``