Categories
News Product

Five Natural Language Processing Subtasks that Large Language Models Can Perform to Improve Healthcare

In late 2022, the technology world was turned upside down as OpenAI released ChatGPT, its new artificial intelligence (AI) model.  Unlike most previous natural language processing (NLP) models, this model contained billions of parameters, was trained on a corpus of unstructured data with unprecedented size, and underwent a novel alignment process to better orient the model towards human needs.  The result was a model that provided superior performance for various NLP-related tasks and professional certification exams (including medical ones) and could produce realistic-sounding text that even the most hardened AI cynics could not deny.  Technological and legal impacts were felt almost immediately. Many technology companies began to invest in the new field of generative artificial intelligence (genAI) with an eye toward the potential for financial benefits. 

However, lost in all the excitement, is the fact that genAI relies largely on the bread-and-butter NLP methods and subtasks that have supported and improved many industries, including healthcare, over the past decade or two, with the additional capability of text generation.  Unlike previous NLP models, newer large language models (LLMs) can perform many different NLP subtasks with one single model. 

So, what are the different subtasks that a state-of-the-art LLM can perform? Without further ado, here are five NLP subtasks that can be programmed into LLM to improve healthcare:

Text Classification

Text classification, or the task of distributing text into categories, is often seen as the most primitive NLP subtask.  A classic non-healthcare example of this subtask is categorizing reviews (e.g., critiquing products or movies) into having positive or negative sentiments. Text can be classified at the sentence or paragraph level, depending on the use case.   

In healthcare, the clinical notes for patients can be classified as to whether they identify patients as having certain attributes or disease characteristics.  Using that information, appropriate personalized interventions and treatments can then be used to improve health.  Over the past year, work at OMNY Health has focused on several text classification projects, including the following:

  1. Classifying generalized pustular psoriasis (GPP) patients or patients with a rare, devastating form of the skin disease psoriasis, by their disease status (flare versus non-flare) in collaboration with a life sciences partner. 
  2. Classifying psoriasis patients as to whether they received joint assessments and/or rheumatology referrals in the presence of psoriatic arthritis symptoms, in collaboration with life sciences partners. 
  3. Identifying patients adversely affected by social determinants of health, including illiteracy and financial and housing insecurity. 

The results of the study are available (View Report).

Named Entity Recognition

Also known as token classification, named entity recognition (NER) involves classifying individual words or phrases as reflective of an entity, for example, a person, place, or organization.  It differs from text classification in that every word is classified, unlike text classification in which sentences and paragraphs are classified.

In healthcare, common entities detected include symptoms, medications, and protected health information (PHI). PHI may be related to individuals, such as names, addresses, and contact information, in larger efforts to deidentify clinical notes before their input in an NLP model.  In fact, at OMNY Health, more than 20 PHI entities are identified and removed from clinical text before NLP to protect patient privacy and to comply with the Health Insurance Portability and Accountability Act (HIPAA) of 1996.

Relation Extraction

Relation extraction (RE) represents a relatively more complex subtask in which the relationships between various entities are determined.  For example, if two “person” entities are detected in the text, an RE task may be used to identify whether the two entities are married.  It is usually performed downstream of an NER step.  Previously, RE tasks often required expensive annotation and labeling processes to be performed successfully, although this requirement has faded with the advent of newer LLMs that can label and annotate notes automatically.

In healthcare, RE applications include identifying patient-doctor relationships for clinical note understanding, as well as more general clinical RE to model medical knowledge ontologies such as the Systematic Nomenclature in Medicine – Clinical Terms (SNOMED-CT).  At OMNY Health, work has been performed to identify relations between detected symptom and medication entity-pairs to determine if they comprised an adverse drug event (ADE).

Closed-domain Question Answering

Closed-domain question answering is an NLP subtask in which a text is paired with a question about the text, like a reading comprehension question on a standardized exam.  The question-context pair is then passed to an NLP model, which then extracts the answer to the question from the text.

In healthcare, question-answering pipelines can be performed to extract practically any conceivable information about a patient from clinical notes, when paired with the correct prompt.  The answer can then be processed and refined using additional NLP steps.  As an example, a clinical note about medication discontinuation could be passed as input to an NLP model paired with the question, “Why was the medication discontinued?”  The answer could then be passed to a text classification model which categorizes the reason for treatment discontinuation.  At OMNY Health, we have constructed such a pipeline to determine reasons for treatment discontinuation for various medications (View Report).  A second application of question-answering is the extraction of severity scores from clinical notes for various diseases.

Text Generation

Finally, text generation is the process of composing new, original text in response to a provided prompt.

The basic mechanism revolves around predicting the next word based on the input prompt and the sequence of previously generated words.  As alluded to in the introduction, modern LLMs undergo alignment to prevent toxic, irrelevant, and/or unhelpful text from being generated. This has resulted in improved output.  Text generation can be used for open-domain question answering, in which a general prompt is provided with no corresponding answer in the context (e.g. “What is the purpose of life?”).  Text generation is also used on the downstream end of retrieval-augmented generation pipelines, in which a query first causes the NLP pipeline to filter the relevant documents in a database, extract an answer, and then generate text to present the answer to the user.

In healthcare, potential applications of text generation include helping physicians write clinical notes to ease documentation burden, as well as simplifying routine tasks for health analysts at pharmaceutical companies.  A true challenge is to accomplish these use cases while protecting patient privacy and preventing any PHI leakage in responses to prompts.  At OMNY Health, we strive to accomplish these and related use cases responsibly.

Conclusion

In conclusion, there is more to LLMs than generating text.  Combining text generation functionality with the subtasks mentioned in this article potentiates LLMs to complete powerful tasks that can help improve healthcare.

Categories
Product

Accelerate the Pace of Life-Changing Innovation

Hidden deep in healthcare data lies powerful insights and clinical revelations. Knowledge, that when shared with healthcare visionaries, can spark actions that will accelerate the pace of life changing innovations.  Curating this wealth of knowledge requires precise data handling and a sharp focus on balancing two important factors:  Patient Privacy and Care Innovation.  

OMNY Health’s technology solutions deliver immediate value to Life Sciences researchers by delivering standardized patient, encounter, notes, and supply data from a diverse set of providers in a secure cloud platform. With off-the-shelf data, dashboards, reports, and analytics available in a variety of formats and combinations, time to insights and decisions accelerate, so your team can focus on improving access to effective treatments and transforming the lives of patients. OMNY works closely with integrated delivery networks, community hospital systems, and even specialty practices to harmonize their data, enable custom analytics and create a real-word data pipeline. 

These clinical insights are shared with Life Sciences companies for use in expediting Research & Development, driving new therapy launches and accelerating commercialization strategies. And while providers and pharmaceutical innovators benefit significantly from the standardization and flow of useful data, the ultimate winners are the patients.

Here’s an example:

Working with 2 health systems, we uncovered insights that directly informed treatment decisions for Myasthenia Gravis, or MG, patients and powered pre-launch research for a new innovative MG drug . What’s more, we provided real-world context to understand MG therapy burden and barriers to care.

Healthcare data holds the promise of clinical innovation. OMNY can connect you with the quality data you need to inform and transform actions that will make a real difference in patients’ lives. The future of healthcare is accessible today.

 To learn more about this story and how to partner with OMNY Health, please contact us.

Categories
Product

How OMNY Health™ Is Supporting the Fight Against MG

OMNY Health™ worked closely with an innovative pharmaceutical company to support the launch of its Myasthenia Gravis (MG) drug by providing data from two of its health system partners.

Compiling in-depth information from more than 30 of our partnered medical facilities, we delivered a snapshot of the real-world healthcare encounters and treatment experience for over 1,700 patients via a secure, de-identified dataset. The OMNY Health Insights professional services team delivered dashboards characterizing the patient journey from initial interaction points to ultimate diagnosis and treatment, a solution that the pharma company’s claims data supplier could not provide. 

Of note, the insights collected enabled deeper insights into both inpatient and outpatient settings, as well as the broad spectrum of medical specialists delivering the care. Not surprising, variation in care was identified across physician specialty.

Armed with this data, our customer has optimized access to its product for patients in need. The insights our team provided allowed their global rare disease experts to better understand patient journeys without compromising privacy. 

Funds from this engagement have already routed back to the participating health systems for reinvestment, empowering them to hire more staff, improve frontline staff salaries, address budget challenges, and invest in new medical equipment. 

To learn more about this story and how to partner with OMNY Health, please contact us.

Categories
Product

OMNY Health’s Provider Leadership Council

OMNY Health™ is establishing a new Provider Leadership Council (PLC), comprised of executives from leading health systems and specialty networks interested in collaborating on opportunities to leverage real-world data and evidence to address the strategic imperatives facing the U.S. healthcare system.

The intent of the PLC is grounded in our vision for OMNY Health – to support providers in extracting value from their real-world data assets that frequently reside across disparate, fragmented information systems.  

We are inviting health system executives from large integrated networks and specialty medical groups to join us and help drive better evidence-based decisions across the industry. 

We believe this initiative will help health systems who are looking to address issues such as health equity, value-based care reimbursement, adapting to telemedicine/telehealth, as well as enabling patient-centered care initiatives.  Data and analytics will play a critical role in planning, executing, and evaluating the impact of approaches focused on tackling these challenges. 

The benefits of participating in the Provider Leadership Council include:

  • Early access to new features and functionality on the OMNY Health Platform
  • Opportunities to collaborate with OMNY Health on product strategy initiatives
  • Research initiatives to drive additional clinical research opportunities to provider institutions
  • Access to cutting edge technology such as AI and blockchain
  • Networking opportunities with opinion leaders and industry experts

OMNY Health plans to kick off the PLC in March 2021 (updated).  If you are a health system leader who would like to get involved or simply learn more, please contact us today.