Linkr
Home Resources Tools Documentation Blog Demo
FR
  • What is Linkr?
  • Deployment modes
  • Quickstart
  • Local install
  • Your first project
Core concepts WIP
Workspaces WIP
Data warehouse WIP
  • Introduction
  • Mapping projects
  • Overview
  • Target Concepts
  • Mapping Editor
  • Evaluation
  • Export
  • Progress
Projects WIP
IDE and notebooks WIP
Versioning and collaboration WIP
Plugins WIP
Administration WIP
Reference WIP
Documentation Concept mapping Target Concepts

Target Concepts

Concept sets and OHDSI vocabularies: the sources of target concepts available in the editor.

Summary

The Target Concepts tab is where you prepare the set of standard concepts you’ll then use to align your source codes. Two sub-tabs: Concept Sets (concept dictionaries) and OHDSI Vocabularies (ATHENA download for free search).

Why this step?

In the Mapping Editor, the right pane gives you two ways to find a target concept:

  1. Browse a concept set from a reference dictionary (fast, focused).
  2. Free search: lets you search across all OHDSI concepts, but picking the right one is harder given how many exist.

This tab populates both modes.

Concept Sets

What is a concept set?

A concept set groups several standard concepts under a single clinical label. For example: “Total bilirubin”, “Sepsis”, or “Norepinephrine”. In practice, each one points to the matching LOINC, SNOMED or RxNorm concepts — often numerous and technical.

The concept set: a bridge between clinicians and data scientists

A single clinical reality often maps to dozens or hundreds of codes in a standard terminology, depending on measurement method, context, patient position, device used… “Heart rate”, for instance, corresponds to several hundred distinct LOINC codes.

Yet clinicians don’t think in LOINC — they think in “heart rate”. That’s exactly what a concept set does: it groups all the relevant codes under a label that makes sense from the clinical side.

Clinician

Clinical variable

"I need heart rate."

Common reference

Concept Set

Reviewed, versioned

OHDSI-compatible

Data scientist

OMOP concepts

"LOINC 8867-4 with its descendants."

This is the philosophy behind thematic concept set libraries like the INDICATE Data Dictionary (intensive care), or tomorrow the libraries of professional societies by specialty (paediatrics, cardiology, oncology…): a shared, reusable body of definitions.

Items and resolution

A concept set is a list of items. Each item is a concept (with a conceptId, a conceptName, a vocabularyId, a conceptCode, a domainId, a conceptClassId and a standardConcept flag) along with three options:

  • isExcluded — the item is excluded from the list (useful for “all X except Y”).
  • includeDescendants — automatically includes more specific concepts (descendants in the concept hierarchy).
  • includeMapped — includes concepts that point to this item via concept_relationship.

A concept set can therefore be seen as an equation: a handful of base items, modulated by these options. Resolving a concept set means applying these options to obtain the final list of all retained concepts — a single item with includeDescendants can expand into several hundred concepts. This resolved list is what the Mapping Editor uses to propose targets.

Going further

  • INDICATE — What are concept sets? — a hands-on introduction to concept sets, in the context of the INDICATE dictionary.
  • OHDSI — Concept Set Specification — the formal specification of the JSON format used by OHDSI (and Linkr).
  • ATLAS — Concept Sets wiki — the reference documentation for creating and managing concept sets in ATLAS.

Any concept set following the OHDSI format can be imported into Linkr — whether it comes from a thematic dictionary like the INDICATE Data Dictionary or is hand-crafted in ATLAS (OHDSI’s reference tool for building concept sets).

linkr-v2-b1800b.frama.io

Import and manage the OHDSI concept sets used as mapping targets.

CategorySubcategoryConcept nameItemsVersionProvenance
...
...
...
...
...
Clinical observationsNeurological assessment
3-minute Diagnostic Interview for CAM-defined Delirium (3D-CAM)
11.0.0INDICATE Consortium
Clinical observationsNeurological assessment
Numeric Pain Rating Scale
31.0.0INDICATE Consortium
DrugsAntimicrobials
Posaconazole
11.0.1INDICATE Consortium
DrugsAntimicrobials
Rifampicin
11.0.1INDICATE Consortium
DrugsAntimicrobials
Sulfamethoxazole
11.0.1INDICATE Consortium
DrugsAntimicrobials
Teicoplanin
11.0.1INDICATE Consortium
DrugsAntimicrobials
Tetracycline
11.0.1INDICATE Consortium
The table of project-linked concept sets, with the main actions.

Importing a concept set

Click Import a Concept Set. Three sources are offered:

  • Reference — bulk import from a referenced dictionary, for example the INDICATE Data Dictionary: all of the dictionary’s concept sets are imported in one go.
  • URL — a direct link to the JSON file of a single concept set (typically a raw.githubusercontent.com/.../concept_sets/X.json file). Linkr keeps the link, so you can later update the concept set in one click if the source changes.
  • File — a .json in OHDSI format ({ expression: { items: [...] } }).

Inspecting a concept set

The ⓘ icon at the end of a row opens a side panel with four tabs:

  • Description — the long-form description written by the concept set’s authors. When the concept set comes from a dictionary like INDICATE, you’ll typically find the clinical definition, the coding choices made by experts, and guidance for concept mapping.
  • Statistics — record and patient counts, distribution charts.
  • Resolved concepts — the list of concepts retained once the concept set has been resolved (see above).
  • Expression — the raw list of items in the concept set, with their conceptName, conceptId, vocabulary, domain, and three columns Ex / De / Ma (ticked when the item is marked isExcluded / includeDescendants / includeMapped).

Update, delete

  • Targeted update — the refresh icon on a row re-fetches the JSON from the source URL.
  • Update all — dedicated button, processes all concept sets that have a sourceUrl.
  • Delete concept sets — click Edit: checkboxes appear, select the concept sets to delete then click Delete selected.

Filter the table

Concept sets are indexed by category, subcategory, provenance (source organisation) and version. Column headers offer dropdown filters. Handy when the catalog grows past several hundred entries.

OHDSI Vocabularies

This is where you load the full set of standard OMOP concepts, from an ATHENA download.

linkr-v2-b1800b.frama.io

Reference vocabulary database

Select a folder containing the ATHENA vocabulary files (CSV or Parquet). At minimum, the CONCEPT table is required.

The ATHENA reference loaded: OHDSI tables are indexed in DuckDB for free search.

What to download?

On ATHENA, select the vocabularies you need (at minimum SNOMED, LOINC, RxNorm for standard usage) and trigger the bundle generation. Once it’s ready, you download it directly from the site as a ZIP file containing the OHDSI tables:

  • concept — required, the full catalog of concepts used by the Mapping Editor’s search.
  • The other tables shipped by ATHENA (concept_ancestor, concept_relationship, concept_synonym, concept_class, domain, relationship, vocabulary, drug_strength) are not used by the concept-mapping feature. You can keep them in the folder — they’ll be useful later for generating an OMOP ETL — but they aren’t needed here.

Load the folder

Unzip the archive, then in Linkr click Select folder and pick the ATHENA directory. Linkr identifies the known files automatically and indexes them in DuckDB.

Browse the vocabulary

Once the reference is loaded, you can browse the standard concepts directly from this sub-tab (search, filters by vocabulary / domain / class, “Standard only” toggle). But more importantly, the Mapping Editor unlocks its “Search” mode in the right pane — that’s where the reference truly pays off.

Local storage

The ATHENA reference is stored in IndexedDB like any other database. It can weigh several hundred MB depending on the vocabularies you choose. Once loaded, it is available to all mapping projects in the workspace — you only have to do it once.

Save space: convert CSVs to Parquet

Browser storage is limited (IndexedDB quota). The CSV files shipped by ATHENA are large and redundant. To save space, it’s recommended to convert the files to Parquet before loading them into Linkr — Parquet is compressed and much more compact (often 5 to 10 times), while remaining directly readable by DuckDB.

PreviousOverviewNextMapping Editor

On this page

Product

  • Home
  • Demo

Resources

  • Documentation
  • Resources
  • Tools
  • Blog

Community

  • Framagit source code
  • Github source code

About

  • InterHop.org
  • Contact

2021–2026 InterHop — CC BY-NC-SA 4.0 (site) · GPLv3 (software)