Exclusive: The Search for Reliable Health Information, Part IV

The reliability of the current state of health information is extremely limited.

EDITOR’S NOTE: Dr. Joseph Nichols is producing a five-part series on healthcare data for ICD10monitor. This is the fourth installment in his exclusive series.

This is the fourth in a series of five articles on the search for reliable health information. This article is focused on that component of the health information domain related to data analysis.  

As mentioned in the first article, the following are high-level requirements for data analysis:

Consistent, clearly defined methods of aggregation or classification.
Statistical validity.
Multifactorial testing of reporting results.
Transparency to analytic methods.
Safeguards against bias-driven analysis.

Consistent, clearly defined methods of aggregation or classification
Assuming that the requirements of data acquisition and data management have been met, the next step in the search for reliable health information requires aggregation of data. Data elements need to be grouped and organized in order to represent key concepts that are the focus of questions. For example, “diabetes” may be the focus of questions like:

What is the incidence of diabetes?
What types of complications are associated with diabetes?
Who gets diabetes?
What factors are associated with a potential cause of diabetes?
What types of treatment work best for diabetes?

The first thing that’s needed is a clear definition of diabetes that delineates how data is aggregated to the “diabetes bucket,” and in a way that is understood by all seeking the information. What should be included? What should be excluded? Should Type I and Type II diabetes be included? Should secondary diabetes and gestational diabetes be included? What codes are needed to represent the definition to ensure that we are capturing all of the intended data from coded data transactions?

Today, concepts like diabetes, heart failure, kidney failure, or any number of other conditions are included in analytic reports without any clear definition of the concept or definition of the criteria by which the source data was aggregated. Without a clear definition and method of aggregation, there is no reassurance that we are making consistent comparisons, or that the analysis we rely on represents what we think it represents. Too often, the job of data aggregation is left to the technical team, with very little guidance. 

Another good example of this issue is the grouping of “diabetes” in the hierarchal condition categories (HCCs) published by the Centers for Medicare & Medicaid Services (CMS). These are widely used categories in the industry for a variety of purposes, primarily risk assignment. However, there is no clear definition of “diabetes” within the HCCs.  Very few users of this data know that the HCCs for diabetes do not include diabetes in pregnancy, or even Type I and Type II diabetes in pregnant women. Few would know whether secondary diabetes is included in this HCC category. The values resulting from any analysis could vary dramatically, depending on the inclusion or exclusion of key data elements.

Potential Solutions

  • The definitions of all analyzed concepts should be complete, clear, consistently applied, published, maintained, and available to all potential users of data analytic reports.
  • The selection of coded data should include input from clinicians, coders, and IT professionals.
  • A repository of all data aggregation logic should be created, maintained, and made available for consistent analysis across all analytic efforts, or any applications that might use or reference the data aggregation methodologies.
  • Clearly defined taxonomies and ontologies should be developed, defined, and maintained as a service for all systems for any data aggregation or reporting purpose.

Statistical validity
We’ve all heard Mark Twain’s observation that there are “lies, damn lies, and statistics.” This might send a message that statistics are suspect, or of questionable value, but it simply means that statistics are often used inappropriately in an attempt to prove some desired answer, independent of the real answer. Statistics never provide an answer. Statistics simply provide a test to support that the data used passes a first mathematical step in getting at an answer. “Statistical significance” does not mean analytic significance. Statistics are an important part of analysis, but only as a numeric test of the data to determine if there is some viable use of that data in a meaningful analytic process.

Potential Solutions

  • Some level of statistical analysis should be done as an important first step in the analytic process. The validity of any reporting of trending, comparisons, or other analytic activities needs to meet a basic number validation as a floor.
  • Policies should be established for statistical requirements that should be a part of any analytic discipline.
  • Data governance should ensure that statistical manipulation is never used to bias the data analysis. As tempting as it may be to present two or three numbers and declare a trend that supports an argument, the definitions of “trends” need to be supported by statistical requirements.

Multifactorial testing of reporting results
Reporting is the platform for disseminating information that is used by decision-makers to make informed choices. Assuming that aggregation and statistical validity have been achieved, the next concern is that those disciplines are carried out in the implementation in a report.

Potential Solutions

  1. The reporting team must be a key part of the data aggregation and statistical effort to ensure the inclusion of all requirements into report writing.
  2. Testing should include reporting that relies upon multiple parameters, and should confirm that all requirements are met consistently.
  3. User testing should include user input about the consistent meaning of the report from the perspectives of multiple users. If key users cannot arrive at consistent conclusions from a report, the report content and design should be re-evaluated.

Transparency to analytic methods
The goal of any reporting is to ensure that reported values represent reliable information derived from complete and reliable data. All users should be able to arrive at as consistent an interpretation as possible, to ensure consensus on the best decisions for action, based on information that accurately reflects reality. 

Potential Solutions

  1. All documentation of report design, methodologies, logic, and statistical testing used to create the reported information should be available for all users.
  2. Definitions should be clear and complete, and documentation should be understandable at the user level.
  3. All users need to be trained in how to use available documentation as part of the decision-making process, and encouraged to present any clarifying questions to a reliable designated source.

Safeguards against bias-driven analysis
One of the biggest challenges facing health information – and for that matter, all information – is bias. There is a strong human tendency to seek information that confirms our own beliefs. Consciously or subconsciously, we tend to drive data and analysis toward the outcome we want, rather than determining the reality of what we should be measuring. It’s hard to objectively assess that which is inconvenient to our expectations. We will never learn anything if we just confirm what we think we know. The next, and last, article in the series will examine the challenging issue of bias.

Summary
Assuming that data acquisition and data management meet all of the requirements described in the preceding articles in this series, analysis must be done in such a way that data is aggregated and reported appropriately. Unless analytic reports support and lead to meaningful actions based on meaningful information, the wrong decisions may result, as well as risk of significant harm to entities, including patients, who rely on the accuracy of that information.

Programming Note: Listen to Dr. Nichols report this story live today during Talk Ten Tuesday, 10-10:30 a.m. EST.

 

Facebook
Twitter
LinkedIn

Related Stories

When in Doubt, Ask

Sometimes you want to be contacted by the US Department of Health and Human Services (HHS) Office of Inspector General (OIG).   I realize that

Read More

Where is the OCR?

The articles describe a significant 2026 dispute over the misuse of health information exchanged by asserting a treatment purpose through Carequality. (Raths) The core allegation

Read More

Leave a Reply

Please log in to your account to comment on this article.

Featured Webcasts

AI, Audits, and the Future of the Revenue Cycle

Artificial intelligence is rapidly transforming healthcare revenue cycle operations, from coding and auditing to compliance and denials. Join industry leaders Pam Warren (MaineHealth) and Raemarie Jimenez (AAPC) for a live fireside chat exploring how AI is changing workflows, workforce roles, payer-provider dynamics, and compliance risk—and what organizations should be doing now to prepare.

June 17, 2026

Trending News

Featured Webcasts

Ask Dr. Hirsch: Clarifying Medicare’s Most Misunderstood Rules – Part 2

Medicare regulations are complex and even seasoned professionals struggle to apply them consistently. Due to overwhelming demand, Dr. Hirsch returns for Part 2 of Ask Dr. Hirsch: Clarifying Medicare’s Most Misunderstood Rules to answer even more of Medicare’s most misunderstood questions, covering inpatient status, observation, SNF access, Medicare Advantage denials, and more. Join Dr. Hirsch as he provides clear, referenced answers to real-world questions submitted by your peers, helping you navigate Medicare compliance with confidence and clarity.

June 18, 2026

Reengineering Utilization Management: Building an Adaptive Model for the New Payer Era

Traditional utilization management models can no longer keep pace with regulatory shifts, payer scrutiny, and operational pressures. In this webcast, Tiffany Ferguson, LMSW, CMAC, ACM, ACPA-C, introduces an Adaptive Model strategy that modernizes UM through role specialization, technology-driven workflows, and proactive, team-based processes. Attendees will learn how to restructure programs to improve efficiency, strengthen clinical collaboration, and enhance financial performance in a rapidly changing healthcare environment.

May 20, 2026

Compliance for the Inpatient Psychiatric Facility (IPF-PPS): Minimizing Federal Audit Findings by Strengthening Best Practices

Federal auditors are intensifying their focus on inpatient psychiatric facilities, using advanced data analytics to spotlight outliers and pursue high‑dollar repayments. In this high‑impact webcast, Michael Calahan, PA, MBA, Compliance Officer and V.P., Hospital & Physician Compliance, breaks down what regulators are really targeting in IPF-PPS admissions, documentation, treatment and discharge planning. Attendees will learn practical steps to tighten processes, avoid common audit triggers and protect reimbursement and reduce the risk of multimillion-dollar repayment demands.

April 9, 2026

Mastering MDM for Accurate Professional Fee Coding

In this timely session, Stacey Shillito, CDIP, CPMA, CCS, CCS-P, CPEDC, COPC, breaks down the complexities of Medical Decision Making (MDM) documentation so providers can confidently capture the true complexity of their care. Attendees will learn practical, efficient strategies to ensure documentation aligns with current E/M guidelines, supports accurate coding, and reduces audit risk, all without adding to charting time.

March 31, 2026

Trending News

Prepare for the 2025 CMS IPPS Final Rule with ICD10monitor’s IPPSPalooza! Click HERE to learn more

Get 15% OFF on all educational webcasts at ICD10monitor with code JULYFOURTH24 until July 4, 2024—start learning today!

This Memorial Day, we honor those who gave all for our freedom. Take 20% off sitewide through May 29 with code MEMORIAL26 at checkout

CYBER WEEK IS HERE! Don’t miss your chance to get 20% off now until Dec. 1 with code CYBER25

CYBER WEEK IS HERE! Don’t miss your chance to get 20% off now until Dec. 2 with code CYBER24