Climate Policy Radar aggregates document collections of laws and policies, litigation cases, submissions to UN multilateral environmental agreements, and other core document sets from expert data providers like UN agencies, research institutes, and multilateral organisations – to make them queryable and usable for a range of global users from parliamentarians to researchers to litigators.
We currently work with approximately fifteen data providers and plan to triple that number over the next two years as we grow into new domains (e.g. nature, development) and types of evidence (e.g. subnational laws and policies, international bilateral agreements).
We are looking for an experienced individual to own how external document collections are structured, integrated and maintained with CPR’s systems as we scale.
This is a core strategic role within the Programmes team, working cross-functionally across the organisation. At its heart, the role sits at the intersection of document collection curation, organisation, and aggregation. You will define and govern CPR’s document ingestion processes and metadata schema requirements, working closely with Product and Engineering on implementation, and with subject matter experts and Partnerships and Operations on provider onboarding and relationship management.
You will ensure that document collections are integrated into CPR’s systems in a way that is structured, consistent, scalable, and usable for search and analysis.
What You’ll Do
Lead the aggregation of external document collections into CPR’s systems, ensuring they are structured, consistent and usable.
Define, apply and maintain metadata standards, schema requirements, taxonomies, and controlled vocabularies, translating organisational and product needs into clear requirements for Engineering implementation.
Evaluate and onboard new sources and datasets. Work alongside the partnerships team to support highly-respected external document collection curators to add documents datasets of laws, policies, litigation cases, climate finance projects, UN submissions and reports to our database.
Anticipate and manage schema evolution as external providers update or expand their data (for example, adding new fields or changing formats), ensuring CPR systems adapt smoothly.
Create and carry out data quality processes, including identifying duplication, improving metadata completeness, and maintaining consistency across collections.
Ensure that content gaps raised by user feedback or analysis feed back into collection priorities and schema development.
Document processes and standards so workflows are repeatable and scalable.
Track and communicate the impact of data ingestion efforts, including metrics on database coverage, data quality and update frequency.