🔥 fhiry — FHIR to Pandas DataFrame for Data Analytics, AI, and ML

FHIRy is a Python package that simplifies health data analytics and machine learning by converting FHIR bundles or NDJSON files from bulk data export into pandas DataFrames. These DataFrames can be used directly with ML libraries such as TensorFlow and PyTorch. FHIRy also supports FHIR server search and FHIR tables on BigQuery.

✨ Features

Flatten FHIR Bundles/NDJSON to DataFrames for analytics and ML
Import from FHIR Server via FHIR Search API
Query FHIR Data on Google BigQuery
LLM-based Natural Language Queries (see examples/llm_example.py)
Flexible Filtering and Column Selection

🔧 Quick Start

Installation

Stable release:

pip install fhiry

Latest development version:

pip install git+https://github.com/dermatologist/fhiry.git

LLM support:

pip install fhiry[llm]

Usage

1. Import FHIR Bundles (JSON) from Folder

import fhiry.parallel as fp
df = fp.process('/path/to/fhir/resources')
print(df.info())

Example dataset: Synthea Notebook: notebooks/synthea.ipynb

2. Import NDJSON from Folder

import fhiry.parallel as fp
df = fp.ndjson('/path/to/fhir/ndjson/files')
print(df.info())

Example dataset: SMART Bulk Data Server Notebook: notebooks/ndjson.ipynb

3. Import FHIR Search Results

Fetch resources from a FHIR server using the FHIR Search API:

from fhiry.fhirsearch import Fhirsearch

fs = Fhirsearch(fhir_base_url="http://fhir-server:8080/fhir")
params = {"code": "http://snomed.info/sct|39065001"}
df = fs.search(resource_type="Condition", search_parameters=params)
print(df.info())

See fhir-search.md for details.

4. Import from Google BigQuery FHIR Dataset

from fhiry.bqsearch import BQsearch
bqs = BQsearch()
df = bqs.search("SELECT * FROM `bigquery-public-data.fhir_synthea.patient` LIMIT 20")

🚀 5. LLM-based Natural Language Queries

FHIRy supports natural language queries over FHIR bundles/NDJSON using llama-index:

pip install fhiry[llm]

See usage: examples/llm_example.py

🚀 6. Convert FHIR Bundles/Resources to Text for LLMs

Convert a FHIR Bundle or resource to a textual representation for LLMs:

from fhiry import FlattenFhir
import json

bundle = json.load(open('bundle.json'))
flatten_fhir = FlattenFhir(bundle)
print(flatten_fhir.flattened)

Filters and Column Selection

You can pass a config JSON to any constructor to remove or rename columns:

df = fp.process('/path/to/fhir/resources', config_json='{ "REMOVE": ["resource.text.div"], "RENAME": { "resource.id": "id" } }')
fs = Fhirsearch(fhir_base_url="http://fhir-server:8080/fhir", config_json='{ "REMOVE": ["resource.text.div"], "RENAME": { "resource.id": "id" } }')
bqs = BQsearch('{ "REMOVE": ["resource.text.div"], "RENAME": { "resource.id": "id" } }')

See df.columns for available columns. Example columns:

patientId
fullUrl
resource.resourceType
resource.id
resource.name
resource.telecom
resource.gender
...

Command Line Interface (CLI)

See CLI examples:

fhiry --help

Documentation

Full documentation: https://dermatologist.github.io/fhiry/

Contributing

We welcome contributions! See CONTRIBUTING.md.

Give Us a Star ⭐️

If you find this project useful, please give us a star to help others discover it.

Name		Name	Last commit message	Last commit date
Latest commit History 437 Commits
.devcontainer		.devcontainer
.github		.github
.vscode		.vscode
docs		docs
examples		examples
notebooks		notebooks
notes		notes
src		src
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
codecov.yaml		codecov.yaml
fhir-search.md		fhir-search.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
t_install.py		t_install.py
test.sh		test.sh
tox.ini		tox.ini
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🔥 fhiry — FHIR to Pandas DataFrame for Data Analytics, AI, and ML

✨ Features

🔧 Quick Start

Installation

Usage

1. Import FHIR Bundles (JSON) from Folder

2. Import NDJSON from Folder

3. Import FHIR Search Results

4. Import from Google BigQuery FHIR Dataset

🚀 5. LLM-based Natural Language Queries

🚀 6. Convert FHIR Bundles/Resources to Text for LLMs

Filters and Column Selection

Command Line Interface (CLI)

Documentation

Contributing

Give Us a Star ⭐️

Contributors

About

Uh oh!

Releases 17

Packages

Uh oh!

Contributors 7

Uh oh!

Languages

License

dermatologist/fhiry

Folders and files

Latest commit

History

Repository files navigation

🔥 fhiry — FHIR to Pandas DataFrame for Data Analytics, AI, and ML

✨ Features

🔧 Quick Start

Installation

Usage

1. Import FHIR Bundles (JSON) from Folder

2. Import NDJSON from Folder

3. Import FHIR Search Results

4. Import from Google BigQuery FHIR Dataset

🚀 5. LLM-based Natural Language Queries

🚀 6. Convert FHIR Bundles/Resources to Text for LLMs

Filters and Column Selection

Command Line Interface (CLI)

Documentation

Contributing

Give Us a Star ⭐️

Contributors

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 17

Packages 0

Uh oh!

Contributors 7

Uh oh!

Languages

Packages