Categories
Featured Product

Development and Validation of an N-gram Model to Differentiate Between Melena and Hematochezia Using Unstructured EHR Notes © 

By Vikas Kumar, Yan Wang, and Lawrence Rasouliyan | OMNY Health 

Extracting Meaningful Insights from Free-Text EHR Data 

Extracting meaningful information from unstructured data is never easy, It’s often a time-consuming task. In structured EHR data for instance, certain values such as diagnosis codes are often insufficient to capture the full context. The same is mostly true for gastrointestinal bleeding, where a slight inaccuracy might cause significant implications for both diagnosis and treatment. 

One such challenge which our team at OMNY Health tried to solve is the differentiation between two forms of GI bleeding: melena, black tarry stools, usually caused by upper GI bleeding, and hematochezia, bright red blood in stool, usually from lower GI bleeding. Although both are captured under general ICD-10 codes, namely K92.1 and K92.2, respectively, these do not make a distinction between the two. Our objective was to bridge this gap using unstructured data and machine learning. 

Building the Model: From Clinical Notes to Meaningful Insights 

We conducted a retrospective observational study using the OMNY Health Real-World Data Platform (2017–2025). Patients with ICD-10 codes starting with “K” (gastrointestinal diseases) or “E” (endocrine diseases) were included. A clinical domain expert reviewed 1,000 random clinical notes from patients with GI bleed–related codes (K92.1 or K92.2) to identify phrases that indicated either melena or hematochezia. 

Through this manual review, our team identified 28 phrases for melena and 51 phrases for hematochezia, which were then used to build two separate N-gram models. These models searched millions of notes across the dataset to identify encounters associated with each condition. 
 
These models are validated against real-world clinical outcomes to ensure reliability. We compared the rates of upper versus lower GI diagnoses, endoscopic procedures (EGD vs. colonoscopy), and pharmacologic treatments within 30 days following each encounter. 

Results: Real-World Validation That Reflects Clinical Reality 

Our validation showed that the N-gram models accurately differentiated between the two GI bleeding types. 

  • Precision: 96% for melena; 98% for hematochezia 
  • Recall: 7.9% for melena; 5.3% for hematochezia 

TABLE 1 — Samples of Phrases Used for Melena and Hematochezia N-gram 

(Note: samples shown; complete list of phrases used to train each model is available in OMNY Health’s internal dataset.) 

Patients identified with melena were more likely to have an upper GI diagnosis and to undergo esophagogastroduodenoscopy (EGD). Conversely, patients with hematochezia were more likely to have lower GI diagnoses and receive colonoscopy procedures. These results aligned closely with clinical expectations, reinforcing the accuracy and validity of our models. 

Figure 1. Validation Outcomes for Melena and Hematochezia N-gram Models 

Why It Matters: A Step Toward Richer Real-World Evidence 

The ability to differentiate between melena and hematochezia in unstructured EHR data proves to be more beneficial for more granular, clinically meaningful insights. This allows researchers and healthcare organizations to:

  • Better characterize patient populations
  • Refine outcome measures for GI bleeding studies
  • Support drug safety and effectiveness research with higher precision

OMNY Health platform helps researchers to unlock the full potential of real-world clinical information, i.e. turning free-text notes into actionable insights, hence improving care delivery and research quality. 

Looking Ahead 

The study demonstrates how natural language processing (NLP) can be effective in bridging gaps in structured EHR data. As we continue validating these models, our primary focus remains on empowering researchers, clinicians, and life science partners with trustworthy, real-world data solutions. 

© 2025 OMNY Health  

Categories
Featured News

OMNY Health Wins Fierce Healthcare Innovation Award for Clinical Information Management 

We are thrilled to announce that OMNY Health has been named a winner in the prestigious Fierce Healthcare Innovation Awards for our groundbreaking work in Clinical Information Management! 

This recognition from Fierce Healthcare, a leading voice at the intersection of healthcare, business, and policy, celebrates the organizations that are driving improvements and truly transforming the industry. 

OMNY Health was honored specifically for our innovative work on Unstructured Clinical Notes, demonstrating our unwavering commitment to unlocking the full potential of real-world data to advance research, power AI, and ultimately, improve patient care. 

OMNY’s Eric Lavin, SVP Commercial, and Dr. Mitesh Rao, CEO at the Fierce Innovation Awards in New York City

Unlocking the Hidden Context of Care 

The majority of critical patient context is often buried within unstructured data inside the Electronic Health Record (EHR). This context includes the nuances of a diagnosis, the challenges of a treatment plan, and the social factors affecting a patient’s journey, such as physician notes, discharge summaries, and clinical reports. This information is notoriously difficult to access, integrate, and utilize at scale. 

Our ground-breaking solution tackles this challenge head-on. OMNY Health’s platform transforms billions of these complex, siloed, unstructured clinical notes into high-quality, regulatory-grade, and AI-ready datasets. By linking this deep, contextual information with structured clinical data, we provide a holistic, longitudinal view of the patient experience. 

This capability is essential for: 

  • Fueling Responsible AI: Providing clean, unbiased, and comprehensive datasets to train next-generation clinical and operational AI models. 
  • Accelerating Research: Giving life sciences and healthcare researchers the necessary context to understand treatment efficacy, patient subpopulations, and complex disease progression. 
  • Driving Health Equity: Ensuring that data used for research and development is truly representative of the national population, including diverse geographies, ethnicities, and care settings. 

A Mission Confirmed 

This award is a powerful validation of our democratic approach to healthcare data, which focuses on partnering directly with provider organizations to create a nationally representative “living data layer.” 

“To be recognized by Fierce Healthcare for our work in Clinical Information Management is a tremendous honor and underscores the criticality of solving the unstructured data problem,” said Mitesh Rao, M.D., CEO and co-founder of OMNY Health. “The true voice of the patient and the context of their care often reside in those notes. By making this data accessible and usable, we are giving researchers, developers, and health systems the fuel they need to deliver on the promise of precision medicine and responsible AI.” 

The Future of Real-World Data 

At OMNY Health, we believe that clean, usable, and representative data is the foundation of a better healthcare future. We are proud to stand among the industry’s most innovative companies and remain committed to expanding our platform to help our partners accelerate discovery and improve outcomes for millions of patients across the nation. 

 

Learn more about the awards and our category win on the official Fierce Healthcare Innovation Awards page. 

Discover how OMNY Health is transforming real-world data for your organization at omnyhealth.com. 

Categories
Product

Characterizing Latent Tuberculosis and Screening Among Individuals Diagnosed with Active Tuberculosis Disease in the United States © 

By Danae A. Black, Amanda Mummert, Amanda G. Althoff, and Lawrence Rasouliyan | OMNY Health 

Tuberculosis (TB) has been a continuous public health challenge in the United States. Even though there are numerous screening and treatment options available, the burden of active TB disease still rises. Almost 80% of TB cases reactivate latent TB infection (LTBI). Timely identification of at-risk individuals and understanding their screening patterns is primarily important to reduce disparities in care.  

Building the Study: From EHR Data to Insights 

Our research team at OMNY Health conducted a retrospective, observational study using the OMNY Health Real-World Data Platform (2020–2024), which integrates electronic health records (EHRs) from multiple U.S. health systems. The goal was to describe individuals diagnosed with respiratory TB and evaluate patterns of TB screening and latent TB diagnosis before the onset of active TB disease. 

Study Design and Methods 

OMNY Health dataset identified the patients with respiratory TB (ICD-10-CM: A15). The earliest date of respiratory TB diagnosis was considered the index date. Demographic characteristics and social determinants of health (SDoH) were summarized at the index date or during the pre-index period. 

Utilization of TB screening procedures (CPT: 86480, 86481, 86580; ICD-10-CM: Z11.1) or diagnosis of latent TB (ICD-10-CM: Z22.7, Z86.15) was evaluated during the pre-index period. Descriptive statistics were reported for all variables of interest. 

Results: Identifying Patterns in Screening and Latent TB 

Between the years 2020-2024, total 6,538 cases of respiratory TB were diagnosed. Among these cases, 238 had a latent TB diagnosis code before the index date and 20% showed evidence of TB screening prior to diagnosis. 

TB testing was significantly higher among females and younger people, whereas it was lower among nonwhite and Hispanic groups. There had been more latent TB recorded among Hispanic individuals. This is consistent with high-risk profiles often seen among overseas-born populations or the ones travelling to counties where it is common. 

Approximately 5% of the population had data available on social health determinants, which revealed certain transportation and education barriers impeding prevention measures or treatment adherence. 

Figure 1. TB Prevention Pathway 

Figure 2. Study Population Demographics Characteristics, by TB Status 

Figure 3. Percentage of Affirmative Responses Across Social Determinants of Health Domains, by TB Status 

Why It Matters: Addressing Gaps in TB Prevention 

The study emphasizes early detection and screening to better prevent TB reactivation. The differences identified in screening rates indicate that demographic and social factors play a vital role to prevent TB. More targeted interventions can be developed to reduce inequities and improve outcomes if proper identification of populations (with limited access to screening and care) is done. 

Looking Ahead 

OMNY Health leverages real-world EHR data to better understand patient journeys enhancing preventive care. Future research is aimed to expand the SDoH integration into predictive modeling and public health decision-making. This will be helpful to bridge the gap between data and actionable outcomes. 

References 

  1. Centers for Disease Control and Prevention. National Data: Reported Tuberculosis in the United States, 2023. Reported Tuberculosis in the United States, 2023. 2024 Nov 7. Accessed February 10, 2025. https://www.cdc.gov/tb-surveillance-report-2023/summary/national.html 
  1. US Preventive Services Task Force. Screening for Latent Tuberculosis Infection in Adults: US Preventive Services Task Force Recommendation Statement. JAMA. 2023;329(17):1487–1494. doi:10.1001/jama.2023.4899C. 

 

© 2025 OMNY Health 

Categories
News

OMNY Health Chosen to Present at Venture Atlanta 2025, the Southeast’s Premier Tech Conference

Now in its 18th year, Venture Atlanta has helped launch 930 companies, raise more than $8 billion in funding, and achieve $20.8 billion in successful exits to date 

 ATLANTA – September 11th, 2025 – OMNY Health is proud to announce its selection as one of the Southeast’s most promising tech companies to present at Venture Atlanta 2025. The event, to be held October 15-16 at The Woodruff Arts Center and Atlanta Symphony Hall, brings together visionary startups and growth-stage companies with hundreds of the nation’s top-tier investors. As the Southeast’s premier platform for tech innovation, growth, and capital access, Venture Atlanta continues to be a launchpad for companies shaping the future. 

Now in its 18th year, Venture Atlanta has helped launch over 930 companies, facilitating over $8 billion in funding and $20.8 billion in successful exits to date. This year, the event has expanded to include new founder pathways, curated networking, and procurement programming to attract over 1,600 attendees, including 450 investment funds from across the U.S. 

“We are incredibly honored to be selected to present at Venture Atlanta 2025. This city has become a hub for health tech innovation, and Venture Atlanta is the premier platform where the future of the industry gets shaped. To be recognized among a peer group of such promising, high-growth companies is a testament to our team and the critical infrastructure we are building. We look forward to sharing how OMNY’s healthcare data ecosystem will fuel the next generation of life-changing innovation.” 

As in previous years, Venture Atlanta 2025 is anticipated to be a sold-out event. 

“Venture Atlanta is where companies come to get discovered,” said Venture Atlanta CEO Allyson Eman. “To be selected in a year as competitive as this one speaks volumes about the strength and potential of these startups. These companies didn’t just stand out—they’re poised to break out. With hundreds of investors in the room and a highly curated audience, the visibility companies get here often leads directly to the funding, partnerships, and momentum they need to thrive. We’re incredibly proud to help founders gain the exposure and support that accelerates their path to success.” 

This year’s Venture Atlanta will once again bring together the Southeast’s most promising technology companies and the investors eager to discover them. With applications from across the region—including Alabama, Arkansas, Florida, Georgia, Kentucky, Louisiana, Maryland, North Carolina, South Carolina, Mississippi, Tennessee, Texas, Virginia, and Washington, D.C.—the 2025 conference offers a rare opportunity to see the region’s top innovators all in one place. 

Venture Atlanta boasts a roster of highly successful alumni, including Bark, CallRail, Car360, Flock Safety, Florence Healthcare, ParkMobile, Salesloft, Kabbage, Bitcoin Depot, PrizePicks, Stax, SingleOps, Pindrop, Terminus, and many others. 

To learn more about OMNY Health, visit www.omnyhealth.com.  For additional information about Venture Atlanta, to register for the event, or to view the conference schedule, please visit www.ventureatlanta.org.  

 

About Venture Atlanta 

Venture Atlanta, the Southeast’s technology innovation event, is where the region’s most promising tech companies meet the country’s top-tier investors. As the Southeast’s largest investor showcase helping launch 930 companies and raise over $8 billion in funding to date, the event connects the region’s top entrepreneurs with local and national investors and others in the technology ecosystem who can help them raise the capital they need to grow their businesses. The annual nonprofit event is a collaboration of the Atlanta CEO Council, Metro Atlanta Chamber, and the Technology Association of Georgia (TAG). For more information, visit www.ventureatlanta.org. For updates, follow us on Twitter and LinkedIn, and visit our blog

 

About OMNY Health 

OMNY Health™ is the leading healthcare ecosystem for compliant real-world data insights at scale. OMNY Health connects patients, providers, and life sciences companies by transforming vast amounts of de-identified electronic health record data, clinical notes, and claims data into robust, research-ready insights. Leveraging proprietary AI, NLP, and LLM technologies, OMNY Health accelerates therapeutic innovation, optimizes clinical development, and enhances patient care. For more information, visit www.omnyhealth.com

Categories
News

OMNY Health Dataset Hits 100 Million Patient Milestone, Unlocking Unprecedented Access to Healthcare Data for Research

Spanning nearly 30 percent of the U.S. population, the de-identified data available includes insights from unstructured clinical notes, electronic medical records, and medical claims

ATLANTA, GA – July 31, 2025 – OMNY Health, the leading healthcare ecosystem for compliant real-world data (RWD) insights at scale, today announced its data network now officially encompasses insights from more than 100 million patients across all 50 U.S. states. This landmark achievement solidifies OMNY Health’s position as a premier source for comprehensive, HIPAA-compliant de-identified RWD, providing life sciences organizations, healthcare providers, and AI innovators with unparalleled access to standardized, rich, longitudinal patient journeys.

Since onboarding its first provider partner in 2020, OMNY Health has expanded to include more than 46 leading healthcare organizations, including St. Luke’s University Health Network, Bon Secours Mercy Health, and Baptist Health System KY & IN, providing a comprehensive view of patient care. The data on its platform spans more than eight years of historical data, including billions of clinical encounters from more than 650,000 providers as well as 6.5 billion clinical notes, making it the deepest repository of unstructured clinical knowledge in the country. The breadth of the OMNY health platform enables deeper insights into disease progression, treatment patterns, and patient outcomes. Additionally, OMNY Health’s data spans every therapeutic area, with most having comprehensive data from more than 10 million patients each, further supporting its diverse use cases across the healthcare ecosystem.

“Unlocking AI’s transformative power in healthcare demands a new approach for collaboration. It’s not just about volume of data, but more importantly, about the quality, diversity, and real-world representation of the data available to researchers,” said Matthew Fenty, Managing Director for Innovation & Strategic Partnerships at St. Luke’s University Health Network. “From their beginnings, OMNY Health has maintained a keen understanding of this critical need and provides a platform that empowers health systems like ours to securely contribute and collectively build the rich, diverse data foundation essential for cutting-edge AI development. Our participation with OMNY Health is a testament to our commitment to advancing patient care through responsible data innovation.”

“The healthcare industry has witnessed time and again how the power of collective knowledge and collaboration can improve the outlook on complex healthcare challenges. The COVID-19 pandemic was a stark reminder of the need for sharing de-identified data to close information gaps,” said Mark Townsend, MD, MHCM, Chief Clinical Digital Ventures Officer, Bon Secours Mercy Health and Accrete Health Partners. “We are proud to partner with OMNY as they continue to bring stakeholders together, ensuring that shared data drives innovation safely and responsibly.”

Brett A. Oliver, MD, Chief Medical Information Officer at Baptist Health System KY & IN, added, “Artificial intelligence is only as smart as the patchwork quilt of information we feed it. At Baptist Health, we’ve learned that real clinical insight—and true equity—emerges only when every thread of the patient story is woven into the model. That’s why OMNY Health’s drive to knit together a living, breathing data network isn’t just helpful; it’s the loom on which tomorrow’s breakthroughs will be spun. We’re thrilled to keep adding squares to that ever‑growing quilt.”

OMNY Health’s proprietary AI, Natural Language Processing (NLP), and Large Language Model (LLM) technologies are central to this achievement, transforming vast amounts of raw, often unstructured, clinical information found in clinical notes into research-ready variables. This unique capability allows researchers to uncover nuances buried in free-text clinician notes – details on symptoms, adverse events, treatment rationales, and social determinants of health – that are critical for a complete understanding of real-world patient experiences.

“Reaching 100 million patient lives is a monumental step forward in our mission to accelerate life-changing innovations through data-driven insights,” said Dr. Mitesh Rao, Founder and CEO of OMNY Health. “This milestone could not have been achieved without the ongoing support of our data partners. We’re incredibly grateful to collaborate with leading organizations that share our vision for advancing healthcare research. As we continue to grow, we look forward to expanding the platform to impact more lives and solve some of the most difficult problems in healthcare.”

OMNY Health plans to continue to expand its dynamic network of specialty health networks, hospitals, academic medical centers, and integrated delivery networks across the U.S., with the goal of achieving more than 125 million patient lives on the platform by the end of 2025. For more information on OMNY Health and its data platform, please visit www.omnyhealth.com.

###

About OMNY Health OMNY Health™ is the leading healthcare ecosystem for compliant real-world data insights at scale. OMNY Health connects patients, providers, and life sciences companies by transforming vast amounts of de-identified electronic health record data, clinical notes, and claims data into robust, research-ready insights. Leveraging proprietary AI, NLP, and LLM technologies, OMNY Health accelerates therapeutic innovation, optimizes clinical development, and enhances patient care. For more information, visit www.omnyhealth.com.

Categories
Product

Unlocking the Full Story: The Power of Clinical Notes in Real-World Data 

The year is 2025, or more than fifteen years since the enaction of The HITECH Act and Meaningful Use.  Almost all of the clinical data recorded from ordinary Americans’ physician office visits and hospital stays have now shifted to electronic format.  Therefore, increasing emphasis is gradually being placed on the value of real-world data, with the hope that medical knowledge resulting in care improvements can be extracted from the vast amount of information that exists in electronic health records.

Structured vs. Unstructured Clinical Data 

This electronic clinical data can be subdivided into two categories – structured and unstructured.  Examples of structured data include demographic information, diagnoses and procedures (in the form of clinical codes), medication prescription information, insurance records, and vital signs, while examples of unstructured data include free-text clinical narratives and imaging and test reports. Both types of clinical information are important and perform complementary functions in real-world data.  Structured data contains many basic data elements and is traditionally easier to process, due to its tabular nature.  However, unstructured data has been estimated to comprise 80% of clinical data by volume and often provides insights that are absent from structured clinical data and claims data [1].  There is an old saying among medical professionals that “90% of diagnoses can be made using the patient history, and 10% using the physical exam [2]” (notably, both elements are virtually absent from structured EHR data). What are some of the details captured in the well-written clinical note that are typically excluded from structured EHR data?

Information Extraction from Unstructured Data in an LLM-World 

A well-written clinical note contains many details about the patient that are absent from both structured tabular data and claims data.  Until just a couple of years ago, the challenge was extracting information from a clinical note into a usable format.  However, with the advent of large language models (LLMs), one can present a note as context and ask a favorite LLM questions about the note, such as “Where is the location of this patient’s pain?” or “Why did the patient discontinue lisinopril?”  Adaptation of this method enables extraction of information from the note as structured categorical data, which can then be used as structured data.

OMNY Notes: A First-of-its-Kind Clinical Notes Data Product  

OMNY Notes is one of our exciting new data products that makes billions of de-identified clinical notes from diverse health systems available to the end-user.   Researchers no longer must rely solely on structured EHR and claims data; they can now view the full patient journey with our HIPAA-compliant de-identified linked structured EHR, claims and notes solutions representing more than 75M individuals. No other solution available today provides the combined depth, breadth, and scale of OMNY structured and unstructured data to support improving quality, safety, and efficiency of healthcare delivery and overall public health. Contact us at info@omnyhealth.com to learn more about our OMNY Notes product and our other data products:
  • OMNY Foundation
  • OMNY Linked Claims
  • OMNY MedTech

References:

[1] https://healthtechmagazine.net/article/2023/05/structured-vs-unstructured-data-in-healthcare-perfcon.

[2] Tsukamoto, Tomoko, et al. “The contribution of the medical history for the diagnosis of simulated cases by medical students.” International Journal of Medical Education 3 (2012).

Categories
Product

Cognitive Impairment in Alzheimer’s Disease: Patient Characteristics and Treatments in the Real-World Setting

OMNY Health’s recent study is shedding light on the role of cognitive impairment in patients diagnosed with Alzheimer’s disease.   Alzheimer’s disease (AD) is the most frequent type of neurodegenerative disease that develops over several years. It is characterized by multiple cognitive deficits that progress over time, including memory deterioration. Newly approved monoclonal antibodies, unlike traditional medications that primarily relieve symptoms, have been shown to slow the progression of Alzheimer’s disease by approximately 30%.

At OMNY Health, researchers recently completed a retrospective analysis (2017 to 2024) of electronic health records from over 150,000 patients that had received Alzheimer’s disease care in the United States.  The study aimed to characterize the cognitive state of patients and to describe their treatment regimens.  OMNY’s researchers focused on nearly 7,000 Alzheimer’s disease patients in the dataset who had either the Montreal Cognitive Assessment (MoCA) or the Mini-Mental State Exam (MMSE) scores available.  

Key Findings  

The findings provide insight into the cognitive state of patients as they are first observed within the health system and receiving a diagnosis of Alzheimer’s disease.   On average patients were 78 years of age at the time of their first observed diagnosis.  Following the first observed diagnosis of Alzheimer’s, patients were categorized into four groups based on their first reported cognitive scores: normal, mild, moderate, and severe.  

The distribution of cognitive severity among patients was as follows: 

  • Normal: 15%  
  • Mild: 37%  
  • Moderate: 35%  
  • Severe: 13% 

OMNY’s research also found that with increasing cognitive impairment (normal, mild, moderate, severe), there was a monotonic rise in the proportions of female patients (51%, 60%, 64%, 67%) and nonwhite patients (15%, 17%, 22%, 27%). 

The study additionally assessed treatment patterns within 30 days of diagnosis. Many patients (59%) received cholinesterase inhibitors or NMDA receptor antagonists—therapies aimed at symptom management—with usage rates consistent across impairment levels. Fewer than 1% of patients had received treatment with newer monoclonal antibodies.

Why It Matters  

As disease-modifying therapies become more accessible, OMNY’s insights can support the research community in evaluating treatment effectiveness, informing clinical decision-making, and advancing understanding of disease progression across patient populations.

Categories
Product

Mining the Clinical Note: Extracting FEV1/FVC with LLMs for Scalable RWE

An OMNY Health Study on Severity Measure Extraction in Respiratory Care

 

Understanding disease severity is essential to supporting treatment decisions, yet many critical severity metrics—especially in respiratory conditions—are often buried in unstructured EHR notes. At ISPOR 2025, OMNY Health presented findings on how large language models (LLMs) can accurately extract FEV1/FVC scores, a core pulmonary function test (PFT) metric, directly from free-text clinical notes.

Why It Matters 

Clinical measures like FEV1/FVC are pivotal in evaluating lung function and diagnosing COPD and asthma. However, these measures are not consistently captured in structured EHR fields, making them difficult to access at scale. OMNY Health’s study explored whether retrieval-augmented LLMs could help close that gap—automating severity extraction while preserving accuracy. 

Study Overview

OMNY Health researchers sampled 50 random note excerpts containing the phrase “FEV1/FVC” from the OMNY Health platform and categorized them as:  

  • Simple (S): One FEV1/FVC score present 
  • No Value (NV): No score provided 
  • Complex (C): Multiple scores present 

OMNY Health researchers tested two Gemini LLMs (Flash and Pro) using a structured prompt and evaluated:

  • Accuracy in correct extraction 
  • Hallucination rate (false outputs when no score existed) 
  • Latency (processing time in slot milliseconds) 

Key Findings 

1. High Accuracy in Simple Notes 
Flash extracted FEV1/FVC scores with 90% accuracy, outperforming Pro (73.3%).

2. No Hallucinations in NV Notes  
Neither model generated phantom data when no score was present.

3. Pro Recognized Complex Context Better 
While Flash returned one of the multiple scores, Pro acknowledged the complexity—useful for nuanced documentation.

4. Flash Was Faster 
Flash processed data using 6,210 slot milliseconds versus Pro’s 23,576—offering speed advantages for scale. 

Example Outputs

Performance Snapshot

What’s Next 

This study highlights how LLMs can reliably extract clinical severity measures from real-world EHR data. Moving forward, the team plans to evaluate performance across more complex note structures, investigate how interpretability and robustness vary by model, and assess the trade-offs between cost, speed, and depth of output. The long-term vision is to expand this approach to other therapeutic areas where structured fields fall short—enabling deeper, more scalable evidence generation. 

Categories
News

OMNY Health Adds More Than 300 Health Measures to Enhance Disease Progression Research and Treatment Monitoring

The Company Now Offers Insights Across Ten Specialty Areas, Including Dermatology, Respiratory, Cardiovascular, Autoimmune, Gastroenterology, and Neurology

ATLANTAMay 8, 2025OMNY Health, the leading healthcare ecosystem for compliant real-world data insights at scale, today announced it has expanded its data network by adding more than 300 clinical assessment measures. These disease-specific measures include information derived from clinical severity indicators, surveys, and questionnaires, such as the Crohn’s Disease Activity Index (CDAI) and the Harvey Bradshaw index for Gastroenterology, as well as the Asthma Control Test and COPD Assessment Test (CAT) for Respiratory. These measures provide a more granular, real-world view of patient health, enhancing the ability to monitor disease progression and treatment impact.

The new measures incorporate data extracted and curated from structured and unstructured sources within OMNY Health’s extensive data pool of electronic health records (EHRs) to enhance the research and insights into medical specialties such as respiratory, cardiovascular, autoimmune, gastroenterology, neurology, and oncology. Adding these new pan-therapeutic measures builds on OMNY Health’s successful data strategy in dermatology, where it first demonstrated the value of leveraging disease-specific measures.

“For some time, OMNY Health has been seen as a dermatology data network, but with our network growth, we are moving beyond that and demonstrating the value of our solutions to the broader healthcare community,” said Dr. Mitesh Rao, CEO and Co-Founder of OMNY Health. “By leveling up research and making more information available across multiple specialties, OMNY Health enables teams to fill information gaps, accelerate research timelines, and make meaningful improvements to patient care.”

OMNY Health’s curated measures offering includes data on:

  • Longitudinal disease progression, flare-ups, and treatment responses for conditions such as asthma, COPD, Crohn’s disease, ulcerative colitis, rheumatoid arthritis, lupus, epilepsy, and multiple sclerosis
  • Real-world evidence on risk factors, medication adherence, and patient outcomes for conditions like heart failure and hypertension
  • Post-surgical recovery tracking, rehabilitation outcomes, and treatment effectiveness for various conditions

By capturing a broader range of clinical information across a larger and diverse set of provider organizations, OMNY Health’s measures provide deeper and more representative insights than traditional disease-specific registries, positioning the company as a go-to source for researchers seeking information and data.

“Democratizing access to longitudinal data that has both breadth and depth has been a longstanding challenge for healthcare researchers like myself,” said Dr. T. Y. Alvin Liu, Inaugural Director of the James P. Gills Jr. MD and Heather Gills Artificial Intelligence Innovation Center at Johns Hopkins Medicine. “As AI begins to leave its mark on the healthcare industry, these new measures from OMNY come at a pivotal time, guaranteeing the highest quality of data to researchers and actionable information to healthcare executives. By taking previously untapped data sources and turning them into resources for better care, we can provide more precise and accurate treatments to patients across care specialties and unparalleled insights for executives to make decisions on a system level.”

OMNY Health’s release of more than 300 new health measures to its data ecosystem follows the company’s announcement in March of the availability of  4 billion unstructured clinical notes. This enabled the company to support its customers in unlocking previously hidden insights, which will drive a better understanding of patient journeys and unleash new innovations in treatment options. For more information on OMNY Health and its data ecosystem, please visit www.omnyhealth.com.

About OMNY Health

OMNY Health™ is a national data ecosystem connecting the world of healthcare to fuel partnerships that improve clinical outcomes and drive patient care. OMNY’s dynamic partnerships with specialty health networks, hospitals, academic medical centers, and integrated delivery networks span all fifty states and cover over 85 million patient lives. The company’s comprehensive data layer powers health tech companies to drive the next generation of innovation. The platform serves as a centralized resource for life sciences and healthcare provider groups to facilitate mutually-beneficial data sharing and research collaboration at scale, fueling innovation where patients need it the most. OMNY Health’s data network now reflects more than eight years of historical data encompassing more than 4 billion clinical notes from 1 billion encounters, 500,000+ providers across 200+ specialties – and is growing. To learn more about how OMNY Health transforms lives and drives patient care by connecting providers and life sciences companies through data, visit www.omnyhealth.com.

Media Contact

SolComms

Caroline Rueve

OMNY@solcomms.co

847-609-4055

Categories
News

OMNY Health Becomes the First EHR Dataset Available on Datavant Connect Powered by AWS Clean Rooms

OMNY Health, the leading healthcare ecosystem for compliant real-world data insights at scale, today announced it has become the first electronic health record (EHR) company to offer its real world data (RWD) dataset on Datavant Connect powered by AWS Clean Rooms through its Lighthouse Partner Program.