Categories
Featured Product

Lung Function Assessment in Patients with Persistent Asthma: Impact of Disease Severity and Acute Exacerbation Status © 

By Lawrence Rasouliyan, Amanda G. Althoff, and Danae A. Black | OMNY Health 

Understanding the Role of Lung Function in Asthma Management 

Lung function monitoring is important in the management of asthma, and it provides valuable information to clinicians on disease control and patient response. 

Pulmonary function tests are common tests conducted in clinical settings, yet they are infrequently documented as measurements in free-text notes or tabulated sources of real-world data. 

Meanwhile, disease severity and acute exacerbations are usually available based on the diagnosis codes such as the ICD-10.  

However, the association between coded severity and measured indices of lung function has not been well characterized in routine clinical care data on a large scale. 

This gap was filled by our team at OMNY Health, which analyzed the relationship of lung function measures with severity and exacerbation status in subjects with persistent asthma. 

Study Overview 

Using electronic health record (EHR) data from 2017 to 2024, we analyzed information from three integrated delivery networks included in the OMNY Health real-world data platform.  

Patients were included if they had an ICD-10 code for persistent asthma classified as mild, moderate, or severe—either with or without an acute exacerbation. The relevant ICD-10 codes used for classification are shown below. 

To be included, patients also needed at least one documented lung function measurement—specifically, forced expiratory volume in one second (FEV₁) percent predicted (pp), forced vital capacity (FVC) pp, or FEV₁/FVC pp—associated with an asthma-related encounter. 

Key Findings 

Out of approximately one million patients identified with an asthma ICD-10 code indicating severity and exacerbation status, 14,003 patients (across 31,463 encounters) had corresponding lung function data available. 

Across all severities, lung function metrics declined with increasing asthma severity, and patients experiencing exacerbations consistently had lower lung function compared to those without exacerbations.

What the Data Suggests 

Findings have shown a considerable decrease in mean lung function values because asthma severity increased—regardless of exacerbation status.  

Patients experiencing exacerbations had consistently lower FEV₁, FVC, and FEV₁/FVC metrics as compared to the ones with no exacerbations.  

Most interestingly, when we compared ICD-10–coded severity to typical clinical cutoffs for lung function, the correspondence was not strong enough.  

This undoubtedly suggests that ICD-10 coding alone may not fully capture physiological severity, emphasizing the importance of integrating structured and unstructured lung function data into real-world datasets. 

Why It Matters 

By leveraging structured EHR data, this study highlights the potential to better understand asthma progression and treatment outcomes across real-world populations. 

The results reinforce the value of using lung function metrics—not just diagnosis codes—to assess disease burden and guide more precise asthma management strategies. 

References 

  1. Levy ML, et al. NPJ Prim Care Respir Med. 2023;33(1):7. 
  1. Firoozi F, et al. Thorax. 2007;62(7):581–7. 
  1. Gronkiewicz C, et al. Chest. 2015;147(4):1152–1160. 
  1. Xie F, et al. JMIR AI. 2025;4:e69132. 

 

© 2025 OMNY Health  

Categories
Product

Development and Validation of an N-gram Model to Differentiate Between Melena and Hematochezia Using Unstructured EHR Notes © 

By Vikas Kumar, Yan Wang, and Lawrence Rasouliyan | OMNY Health 

Extracting Meaningful Insights from Free-Text EHR Data 

Extracting meaningful information from unstructured data is never easy, It’s often a time-consuming task. In structured EHR data for instance, certain values such as diagnosis codes are often insufficient to capture the full context. The same is mostly true for gastrointestinal bleeding, where a slight inaccuracy might cause significant implications for both diagnosis and treatment. 

One such challenge which our team at OMNY Health tried to solve is the differentiation between two forms of GI bleeding: melena, black tarry stools, usually caused by upper GI bleeding, and hematochezia, bright red blood in stool, usually from lower GI bleeding. Although both are captured under general ICD-10 codes, namely K92.1 and K92.2, respectively, these do not make a distinction between the two. Our objective was to bridge this gap using unstructured data and machine learning. 

Building the Model: From Clinical Notes to Meaningful Insights 

We conducted a retrospective observational study using the OMNY Health Real-World Data Platform (2017–2025). Patients with ICD-10 codes starting with “K” (gastrointestinal diseases) or “E” (endocrine diseases) were included. A clinical domain expert reviewed 1,000 random clinical notes from patients with GI bleed–related codes (K92.1 or K92.2) to identify phrases that indicated either melena or hematochezia. 

Through this manual review, our team identified 28 phrases for melena and 51 phrases for hematochezia, which were then used to build two separate N-gram models. These models searched millions of notes across the dataset to identify encounters associated with each condition. 
 
These models are validated against real-world clinical outcomes to ensure reliability. We compared the rates of upper versus lower GI diagnoses, endoscopic procedures (EGD vs. colonoscopy), and pharmacologic treatments within 30 days following each encounter. 

Results: Real-World Validation That Reflects Clinical Reality 

Our validation showed that the N-gram models accurately differentiated between the two GI bleeding types. 

  • Precision: 96% for melena; 98% for hematochezia 
  • Recall: 7.9% for melena; 5.3% for hematochezia 

TABLE 1 — Samples of Phrases Used for Melena and Hematochezia N-gram 

(Note: samples shown; complete list of phrases used to train each model is available in OMNY Health’s internal dataset.) 

Patients identified with melena were more likely to have an upper GI diagnosis and to undergo esophagogastroduodenoscopy (EGD). Conversely, patients with hematochezia were more likely to have lower GI diagnoses and receive colonoscopy procedures. These results aligned closely with clinical expectations, reinforcing the accuracy and validity of our models. 

Figure 1. Validation Outcomes for Melena and Hematochezia N-gram Models 

Why It Matters: A Step Toward Richer Real-World Evidence 

The ability to differentiate between melena and hematochezia in unstructured EHR data proves to be more beneficial for more granular, clinically meaningful insights. This allows researchers and healthcare organizations to:

  • Better characterize patient populations
  • Refine outcome measures for GI bleeding studies
  • Support drug safety and effectiveness research with higher precision

OMNY Health platform helps researchers to unlock the full potential of real-world clinical information, i.e. turning free-text notes into actionable insights, hence improving care delivery and research quality. 

Looking Ahead 

The study demonstrates how natural language processing (NLP) can be effective in bridging gaps in structured EHR data. As we continue validating these models, our primary focus remains on empowering researchers, clinicians, and life science partners with trustworthy, real-world data solutions. 

© 2025 OMNY Health  

Categories
Product

Characterizing Latent Tuberculosis and Screening Among Individuals Diagnosed with Active Tuberculosis Disease in the United States © 

By Danae A. Black, Amanda Mummert, Amanda G. Althoff, and Lawrence Rasouliyan | OMNY Health 

Tuberculosis (TB) has been a continuous public health challenge in the United States. Even though there are numerous screening and treatment options available, the burden of active TB disease still rises. Almost 80% of TB cases reactivate latent TB infection (LTBI). Timely identification of at-risk individuals and understanding their screening patterns is primarily important to reduce disparities in care.  

Building the Study: From EHR Data to Insights 

Our research team at OMNY Health conducted a retrospective, observational study using the OMNY Health Real-World Data Platform (2020–2024), which integrates electronic health records (EHRs) from multiple U.S. health systems. The goal was to describe individuals diagnosed with respiratory TB and evaluate patterns of TB screening and latent TB diagnosis before the onset of active TB disease. 

Study Design and Methods 

OMNY Health dataset identified the patients with respiratory TB (ICD-10-CM: A15). The earliest date of respiratory TB diagnosis was considered the index date. Demographic characteristics and social determinants of health (SDoH) were summarized at the index date or during the pre-index period. 

Utilization of TB screening procedures (CPT: 86480, 86481, 86580; ICD-10-CM: Z11.1) or diagnosis of latent TB (ICD-10-CM: Z22.7, Z86.15) was evaluated during the pre-index period. Descriptive statistics were reported for all variables of interest. 

Results: Identifying Patterns in Screening and Latent TB 

Between the years 2020-2024, total 6,538 cases of respiratory TB were diagnosed. Among these cases, 238 had a latent TB diagnosis code before the index date and 20% showed evidence of TB screening prior to diagnosis. 

TB testing was significantly higher among females and younger people, whereas it was lower among nonwhite and Hispanic groups. There had been more latent TB recorded among Hispanic individuals. This is consistent with high-risk profiles often seen among overseas-born populations or the ones travelling to counties where it is common. 

Approximately 5% of the population had data available on social health determinants, which revealed certain transportation and education barriers impeding prevention measures or treatment adherence. 

Figure 1. TB Prevention Pathway 

Figure 2. Study Population Demographics Characteristics, by TB Status 

Figure 3. Percentage of Affirmative Responses Across Social Determinants of Health Domains, by TB Status 

Why It Matters: Addressing Gaps in TB Prevention 

The study emphasizes early detection and screening to better prevent TB reactivation. The differences identified in screening rates indicate that demographic and social factors play a vital role to prevent TB. More targeted interventions can be developed to reduce inequities and improve outcomes if proper identification of populations (with limited access to screening and care) is done. 

Looking Ahead 

OMNY Health leverages real-world EHR data to better understand patient journeys enhancing preventive care. Future research is aimed to expand the SDoH integration into predictive modeling and public health decision-making. This will be helpful to bridge the gap between data and actionable outcomes. 

References 

  1. Centers for Disease Control and Prevention. National Data: Reported Tuberculosis in the United States, 2023. Reported Tuberculosis in the United States, 2023. 2024 Nov 7. Accessed February 10, 2025. https://www.cdc.gov/tb-surveillance-report-2023/summary/national.html 
  1. US Preventive Services Task Force. Screening for Latent Tuberculosis Infection in Adults: US Preventive Services Task Force Recommendation Statement. JAMA. 2023;329(17):1487–1494. doi:10.1001/jama.2023.4899C. 

 

© 2025 OMNY Health 

Categories
Product

Unlocking the Hidden Details: Why Unstructured Clinical Notes Are Crucial for Life Sciences 

In the world of life sciences, data is king. For decades, the focus has largely been on structured data, including neatly organized tables, registries with predefined fields, clinical trial results captured in clinical report forms (CRFs), and insurance claims data. While undeniably valuable, this structured data often tells only part of the story. 

The real goldmine, often overlooked and underutilized, lies within unstructured clinical text. These free-text narratives – physician notes, discharge summaries, pathology reports, and radiology findings, to name a few types – contain a wealth of detailed, nuanced, and patient-specific information that rows and columns simply cannot capture fully. 

For life sciences companies, understanding and extracting insights from this unstructured data is no longer a luxury, but a necessity.

Here’s why: 

The Limitations of Structured Data 

Imagine trying to understand a complex patient journey solely from a checklist. Structured data, by its very nature, simplifies and categorizes. It’s excellent for tracking demographics, diagnosis codes, medication lists, and lab results. However, it often misses: 

  • Nuance and Context: The why behind a diagnosis, the specific symptoms a patient described, or the subtle changes in their condition over time.  A coded diagnosis of “headache” doesn’t reveal if it’s a throbbing migraine, a dull ache, or accompanied by visual disturbances.  In fact, most initial visit notes contain a History of Present Illness (HPI) that document the seven cardinal features of the patient’s reason for the encounter: 
    • Onset: When did the symptoms start? (The beginning time/date). 
    • Location: Where on the body is the symptom? Does it radiate or travel anywhere else? 
    • Duration: How long does the symptom last when it occurs? (e.g., seconds, hours, constant). 
    • Character (or Quality): What does it feel like or look like? (e.g., sharp, dull, throbbing, crushing, burning). 
    • Aggravating/Alleviating Factors: What makes it better or worse? (e.g., movement, rest, food, medication). 
    • Radiation (or Related Symptoms): Does the symptom move to another part of the body (Radiation)? Or are there any other symptoms that occur with the primary one (Associated Symptoms)? 
    • Timing (or Temporal characteristics): When does it occur? (e.g., constantly, intermittently, only in the morning, with exertion). 
    • Severity (or Scale): How bad is the symptom? (Usually rated on a scale, such as 1-10 for pain). 
  • Patient History Beyond Codes: Family history details, lifestyle factors, or environmental exposures that might not fit into a predefined field. 
    • Family History: Particularly important for oncology and cardiovascular disease, involves identifying which nuclear/extended family members had related conditions. 
    • Lifestyle Factors: Smoking, alcohol usage, illicit drug use, living situation, social determinants of health, and sexual health are known to be important risk factors but often omitted from structured data. 
    • Occupational / Environmental Exposures: Extra information that sheds light on risk factors for diseases including cancer and asthma.   
  • Treatment Rationale and Adjustments: Why a particular treatment was chosen / switched to / switched from / discontinued, how a patient responded to it, and subsequent modifications. 
  • Rare Disease Insights: For conditions with limited structured data such as their own ICD-10 codes, the narrative of clinical notes becomes even more critical. 

Four Applications of Unstructured Data 

Now that we have established how notes can be delivered to researchers, how exactly can they be used to enhance clinical knowledge?   

After de-identification, the possibilities are endless.   Below we describe four common applications. 

  • Extraction of Disease Severity: As opposed to the clinical trial world in which key outcomes are dutifully and regularly recorded, in the real-world researchers are reliant on physician record-keeping to identify the waxing and waning of disease progression.  And often, these outcomes, also known as severity measures, are found in free-text notes.  Learn more about how researchers at OMNY Health have been extracting information about disease severity from Notes since 2022 with the use of transformer-based pipelines and more recently with large language models (LLMs).   
  • Identifying Reasons for Treatment Discontinuation: Rollouts of newly developed drugs cost pharmaceutical companies millions of dollars; with that amount of investment, it becomes imperative to know more about why new drugs are being discontinued by patients/physicians, and which drugs are taking their place.  At OMNY Health, we have built various pipelines for extracting this information from clinical notes, again using both transformer-based methods and LLMs.   
  • Researching Rare Diseases: For rare diseases, clinical note repositories can be particularly useful in pooling large numbers of patients having such diseases and establishing basic clinical knowledge about them – e.g. What patients are at risk?  Why do some patients experience flares?  What treatments work best? An example is work that OMNY Health completed in partnership with a life sciences company on generalized pustular psoriasis (GPP).   Notes can also be used to find patients exhibiting symptoms or characteristics that might be consistent with undiagnosed rare diseases.  Clinical Notes can help identify patients that are candidates for genetic tests that could potentially validate a rare disease diagnosis. 
  • Training AI Models: In the age of Generative AI and LLMs, it is becoming more important than ever to find reputable sources of healthcare data (read: not the Internet) with which to train healthcare-specific LLMs that can reason without harmful biases.  Need a proven source of de-identified clinical notes from diverse populations and provider mixes with which to train your LLM?  We have made it possible.   

Learn More about OMNY Notes – Contact Us! 

At OMNY Health, we would love to discuss how our OMNY Notes product combined with our structured data offerings can support your clinical research initiatives and ultimately improve health outcomes.  Please contact us at info@omnyhealth.com.  We look forward to hearing from you! 

Categories
Product

Powering Research with LLMs and OMNY Health’s De-identified Clinical Notes in Cystic Fibrosis 

OMNY Health Notes when used in tandem with publicly accessible large language models can generate novel insights that previously were only accessible with labor intensive chart reviews.   

Large language models (LLMs) have advanced dramatically since they were first introduced to the mainstream public a few years ago.   When combined with large, nationally representative de-identified data sets like the OMNY Notes product they can deliver insights in just a few minutes that previously would have required months of effort, and also at a fraction of the cost.  Check out this brief demonstration on how today’s LLMs and OMNY Notes can be used together to enhance clinical research, using notes for patients treated with cystic fibrosis as a sample use case. 

Thank you for watching our demoPlease contact us using the email address/link provided in the video to discover how your research team can harness novel insights using OMNY Notes. 

Categories
Product

Unlocking the Full Story: The Power of Clinical Notes in Real-World Data 

The year is 2025, or more than fifteen years since the enaction of The HITECH Act and Meaningful Use.  Almost all of the clinical data recorded from ordinary Americans’ physician office visits and hospital stays have now shifted to electronic format.  Therefore, increasing emphasis is gradually being placed on the value of real-world data, with the hope that medical knowledge resulting in care improvements can be extracted from the vast amount of information that exists in electronic health records.

Structured vs. Unstructured Clinical Data 

This electronic clinical data can be subdivided into two categories – structured and unstructured.  Examples of structured data include demographic information, diagnoses and procedures (in the form of clinical codes), medication prescription information, insurance records, and vital signs, while examples of unstructured data include free-text clinical narratives and imaging and test reports.

Both types of clinical information are important and perform complementary functions in real-world data.  Structured data contains many basic data elements and is traditionally easier to process, due to its tabular nature.  However, unstructured data has been estimated to comprise 80% of clinical data by volume and often provides insights that are absent from structured clinical data and claims data [1].  There is an old saying among medical professionals that “90% of diagnoses can be made using the patient history, and 10% using the physical exam [2]” (notably, both elements are virtually absent from structured EHR data).

What are some of the details captured in the well-written clinical note that are typically excluded from structured EHR data?

Information Extraction from Unstructured Data in an LLM-World 

A well-written clinical note contains many details about the patient that are absent from both structured tabular data and claims data.  Until just a couple of years ago, the challenge was extracting information from a clinical note into a usable format.  However, with the advent of large language models (LLMs), one can present a note as context and ask a favorite LLM questions about the note, such as “Where is the location of this patient’s pain?” or “Why did the patient discontinue lisinopril?”  Adaptation of this method enables extraction of information from the note as structured categorical data, which can then be used as structured data.

OMNY Notes: A First-of-its-Kind Clinical Notes Data Product  

OMNY Notes is one of our exciting new data products that makes billions of de-identified clinical notes from diverse health systems available to the end-user.   Researchers no longer must rely solely on structured EHR and claims data; they can now view the full patient journey with our HIPAA-compliant de-identified linked structured EHR, claims and notes solutions representing more than 75M individuals. No other solution available today provides the combined depth, breadth, and scale of OMNY structured and unstructured data to support improving quality, safety, and efficiency of healthcare delivery and overall public health.

Contact us at info@omnyhealth.com to learn more about our OMNY Notes product and our other data products:

  • OMNY Foundation
  • OMNY Linked Claims
  • OMNY MedTech

References:

[1] https://healthtechmagazine.net/article/2023/05/structured-vs-unstructured-data-in-healthcare-perfcon.

[2] Tsukamoto, Tomoko, et al. “The contribution of the medical history for the diagnosis of simulated cases by medical students.” International Journal of Medical Education 3 (2012).

Categories
Product

Cognitive Impairment in Alzheimer’s Disease: Patient Characteristics and Treatments in the Real-World Setting

OMNY Health’s recent study is shedding light on the role of cognitive impairment in patients diagnosed with Alzheimer’s disease.   Alzheimer’s disease (AD) is the most frequent type of neurodegenerative disease that develops over several years. It is characterized by multiple cognitive deficits that progress over time, including memory deterioration. Newly approved monoclonal antibodies, unlike traditional medications that primarily relieve symptoms, have been shown to slow the progression of Alzheimer’s disease by approximately 30%.

At OMNY Health, researchers recently completed a retrospective analysis (2017 to 2024) of electronic health records from over 150,000 patients that had received Alzheimer’s disease care in the United States.  The study aimed to characterize the cognitive state of patients and to describe their treatment regimens.  OMNY’s researchers focused on nearly 7,000 Alzheimer’s disease patients in the dataset who had either the Montreal Cognitive Assessment (MoCA) or the Mini-Mental State Exam (MMSE) scores available.  

Key Findings  

The findings provide insight into the cognitive state of patients as they are first observed within the health system and receiving a diagnosis of Alzheimer’s disease.   On average patients were 78 years of age at the time of their first observed diagnosis.  Following the first observed diagnosis of Alzheimer’s, patients were categorized into four groups based on their first reported cognitive scores: normal, mild, moderate, and severe.  

The distribution of cognitive severity among patients was as follows: 

  • Normal: 15%  
  • Mild: 37%  
  • Moderate: 35%  
  • Severe: 13% 

OMNY’s research also found that with increasing cognitive impairment (normal, mild, moderate, severe), there was a monotonic rise in the proportions of female patients (51%, 60%, 64%, 67%) and nonwhite patients (15%, 17%, 22%, 27%). 

The study additionally assessed treatment patterns within 30 days of diagnosis. Many patients (59%) received cholinesterase inhibitors or NMDA receptor antagonists—therapies aimed at symptom management—with usage rates consistent across impairment levels. Fewer than 1% of patients had received treatment with newer monoclonal antibodies.

Why It Matters  

As disease-modifying therapies become more accessible, OMNY’s insights can support the research community in evaluating treatment effectiveness, informing clinical decision-making, and advancing understanding of disease progression across patient populations.

Categories
Product

Mining the Clinical Note: Extracting FEV1/FVC with LLMs for Scalable RWE

An OMNY Health Study on Severity Measure Extraction in Respiratory Care

 

Understanding disease severity is essential to supporting treatment decisions, yet many critical severity metrics—especially in respiratory conditions—are often buried in unstructured EHR notes. At ISPOR 2025, OMNY Health presented findings on how large language models (LLMs) can accurately extract FEV1/FVC scores, a core pulmonary function test (PFT) metric, directly from free-text clinical notes.

Why It Matters 

Clinical measures like FEV1/FVC are pivotal in evaluating lung function and diagnosing COPD and asthma. However, these measures are not consistently captured in structured EHR fields, making them difficult to access at scale. OMNY Health’s study explored whether retrieval-augmented LLMs could help close that gap—automating severity extraction while preserving accuracy. 

Study Overview

OMNY Health researchers sampled 50 random note excerpts containing the phrase “FEV1/FVC” from the OMNY Health platform and categorized them as:  

  • Simple (S): One FEV1/FVC score present 
  • No Value (NV): No score provided 
  • Complex (C): Multiple scores present 

OMNY Health researchers tested two Gemini LLMs (Flash and Pro) using a structured prompt and evaluated:

  • Accuracy in correct extraction 
  • Hallucination rate (false outputs when no score existed) 
  • Latency (processing time in slot milliseconds) 

Key Findings 

1. High Accuracy in Simple Notes 
Flash extracted FEV1/FVC scores with 90% accuracy, outperforming Pro (73.3%).

2. No Hallucinations in NV Notes  
Neither model generated phantom data when no score was present.

3. Pro Recognized Complex Context Better 
While Flash returned one of the multiple scores, Pro acknowledged the complexity—useful for nuanced documentation.

4. Flash Was Faster 
Flash processed data using 6,210 slot milliseconds versus Pro’s 23,576—offering speed advantages for scale. 

Example Outputs

Performance Snapshot

What’s Next 

This study highlights how LLMs can reliably extract clinical severity measures from real-world EHR data. Moving forward, the team plans to evaluate performance across more complex note structures, investigate how interpretability and robustness vary by model, and assess the trade-offs between cost, speed, and depth of output. The long-term vision is to expand this approach to other therapeutic areas where structured fields fall short—enabling deeper, more scalable evidence generation. 

Categories
Product

OMNY Health’s Real-World Data is Redefining Clinical Trial Control Arms

The pharmaceutical industry is rapidly embracing real-world evidence (RWE) to accelerate drug development and regulatory approvals. External control arms (ECAs), built using real-world data (RWD), are transforming clinical trials by offering an alternative to traditional placebo groups. By reducing recruitment burdens, increasing statistical power, and aligning with regulatory expectations, ECAs have the potential to redefine late-stage drug development.

OMNY Health is at the forefront of this shift, providing research-ready EHR datasets that seamlessly integrate into clinical trial designs. Our recent study demonstrates how EHR-derived control arms can effectively mirror traditional placebo groups, offering a robust and scalable solution for regulatory submissions.

Building an External Control Arm with OMNY Health’s RWD

To evaluate the feasibility of an ECA in a late-stage clinical trial, OMNY Health leveraged six specialty dermatology networks and six integrated delivery networks (2017-2024) from its real-world data platform. The study constructed an ECA for the Phase 3 POETYK PSO-1 trial, which evaluated deucravacitinib versus placebo for moderate-to-severe plaque psoriasis.

The ECA was built using precise patient selection criteria that ensured alignment with the trial’s placebo group. Patients were included if they met physician global assessment (PGA) score eligibility, ensuring comparable disease severity. Baseline treatment history was carefully controlled, and topical medication use was restricted prior to the index severity visit to reflect treatment-naïve status. By tracking patients longitudinally, we assessed week-16 outcomes using structured EHR data, mirroring the trial’s methodology.

While some demographic differences were observed—ECA patients were older, more likely to be female, and had a different racial distribution compared to the placebo arm—the disease severity and baseline clinical characteristics were well matched.

Key Findings: OMNY Health’s ECA vs. Trial Placebo Arm

The real-world ECA demonstrated a significantly higher response rate compared to the traditional placebo arm, reinforcing its validity as a comparator.

  • 18.5% of ECA patients achieved PGA 0/1 (clear or almost clear skin), versus only 7.2% in the placebo arm.
  • Despite differences in age, gender, and race, disease burden and severity scores aligned closely between the ECA and placebo group.
  • Stratification by prior biologic use, systemic therapy history, and weight showed no notable impact on outcomes.

One notable finding was racial disparities in achieving the primary endpoint within the ECA, suggesting that differences in patient demographics between the ECA and placebo group may have contributed to the observed differences in response rates. Further research could explore adjustments such as population weighting to refine ECA comparability even further.

These results confirm that OMNY Health’s real-world dataset can be used to generate ECAs that replicate clinical trial placebo groups, while also revealing new insights into patient diversity, treatment history, and long-term outcomes.

Why OMNY Health’s Data is a Game-Changer for External Control Arms

Traditional clinical trials face high recruitment costs, ethical concerns over placebo use, and long enrollment timelines. OMNY Health helps to eliminate these barriers by offering regulatory-grade EHR data that aligns with clinical trial endpoints.

With 85M+ patients, 1B+ encounters, and 4B+ unstructured clinical notes, our dataset provides a scalable and statistically powerful alternative to traditional control groups. By incorporating structured disease severity scores, prescribing patterns, and physician-reported outcomes, our ECAs offer more efficient and cost-effective alternative to prospectively collected placebo data.

Beyond reducing recruitment time, OMNY Health’s real-world ECAs improve trial generalizability, capturing diverse patient populations often underrepresented in traditional studies. As the FDA increasingly endorses RWD for regulatory decision-making, the ability to integrate ECA’s into pivotal trials is becoming a competitive advantage for pharmaceutical companies.

Categories
Product

From Symptom to Strategy: What Itch Intensity Data Tells Us About Pruritus Management

Pruritus, or chronic itch, is one of the most common symptoms reported in dermatology, yet real-world treatment patterns remain poorly characterized. While topical therapies are typically the first-line approach, severe cases often require systemic interventions. But how do clinicians determine when to escalate therapy? OMNY Health conducted a real-world evidence study leveraging research-ready EHR data from six specialty dermatology networks to examine how itch intensity, a critical but underutilized measure, influences real-world prescribing patterns. By integrating structured and unstructured clinical data, OMNY provides new insights into the relationship between symptom severity and treatment decisions.

Quantifying Pruritus Severity in Real-World Data

While pruritus is a symptom, its severity can significantly impact quality of life and treatment choices. This study focused on patients with a documented 10-point itch intensity score, captured through physician assessments and patient-reported outcomes. The dataset included 7,330 patients with 8,115 encounters, capturing the following structured severity measures:

  • Itch intensity scores (0-10 scale) recorded alongside pruritus-related encounters.
  • Pruritus-related prescriptions categorized into topical and systemic treatments.
  • Demographic characteristics, including age, gender, and race, to understand patient stratification.
  • Treatment patterns by disease severity, assessing whether increasing itch intensity influenced therapy selection.

By leveraging EHR-derived severity assessments, the study provides a more granular understanding of how treatment decisions align with symptom burden.

How Itch Intensity Drives Treatment Decisions

Findings revealed a clear relationship between itch severity and systemic therapy use, while topical therapies were prescribed at consistent rates across all levels of severity.

  • Topical corticosteroids were the most prescribed treatment, used in nearly half of all pruritus-related visits, regardless of itch intensity.
  • Topical calcineurin inhibitors were prescribed far less frequently, at around 10% of cases, with minimal use of alternative topical therapies.
  • Systemic therapy prescriptions increased as itch severity worsened, with sedative antihistamines emerging as the most commonly used option.
  • Prescription rates for sedative antihistamines climbed from 13% in mild cases to 26% in severe cases, while other systemic treatments—including non-sedative antihistamines, systemic doxepin, SSRIs, and opioid receptor antagonists—were prescribed far less frequently.

The increasing use of sedative antihistamines in severe cases suggests a reliance on limited systemic options, leaving a gap in alternative therapies for patients with refractory pruritus.

Why Itch Intensity Matters for Clinical and Research Applications

The underutilization of structured itch severity scores in real-world studies has historically limited the ability to quantify treatment impact. Our findings reinforce why itch intensity should be a standard measure in both clinical decision-making and drug development. For clinicians, understanding real-world prescribing trends tied to symptom severity can optimize treatment pathways and inform escalation decisions. For researchers and trial sponsors, itch intensity scores provide an opportunity to refine patient segmentation, support cohort identification, and assess real-world treatment responses. For drug developers, the limited use of alternative systemic therapies highlights a need for novel treatments, particularly for patients with refractory pruritus.

Advancing Dermatology Research with Real-World Data

This EHR-based study provides new insights into how structured itch intensity measures correlate with real-world prescribing behavior. By integrating structured severity assessments with prescribing data, OMNY Health’s research-ready datasets offer a unique lens into treatment trends, patient stratification, and disease burden.