The Importance of Understanding Data Before Using it

The Importance of Understanding Data Before Using it

I was recently performing some analysis using cost report data. At first pass, it looked like hospitals in the United States had made a huge profit, according to cost reports filed in 2021. Looking deeper, however, I noticed a few hospitals with net income in 2021 many times larger than in 2020. Unreasonably larger. In digging, I noted a few hospitals had not input correct data.

It is a common occurrence when working with large datasets, particularly those that rely on self-reported or manually entered information, to find issues with the data that impacts analysis.

Here are a few steps you can take before and after you find issues with data you are using:

  1. Data Validation: Implement some validation rules on your analysis to identify potential errors in the data. This could be as simple as looking for values that are several standard deviations away from the mean, or comparing changes in net income year over year and flagging those that exceed a certain threshold.
  2. Contact the Data Source: If the data comes from a specific provider, like a government agency or a private company, you might want to reach out to them to notify them of the potential errors. They may be able to correct the issue at the source, or at least provide you with more accurate data. In this case, it would involve mentioning the issue to the Centers for Medicare & Medicaid Services (CMS).
  3. Data Cleaning: Depending on the nature and scale of the inaccuracies, you might be able to correct them yourself. If it’s just a few outliers, you might be able to replace them with more reasonable estimates. If it’s a systemic issue, you might need to apply some kind of transformation to the data to account for it.
  4. Sensitivity Analysis: In some cases, you might want to run your analysis both with and without the questionable data to see how much it affects your results. This can give you a sense of how sensitive your conclusions are to these potential errors.
  5. Disclose Uncertainty: Finally, when you present your results, be sure to disclose these potential inaccuracies. It’s always better to be upfront about the limitations of your analysis than to overstate your confidence in the results.

Remember that the goal of data analysis is not to produce the perfect answer, but to make the best possible use of the information available. By being diligent and transparent in your work, you can help ensure that your conclusions are as reliable and useful as possible.

Always tell anyone you provide analysis to how you got the data and whether you had to adjust the data.

Facebook
Twitter
LinkedIn

Timothy Powell, CPA, CHCP

Timothy Powell is a nationally recognized expert on regulatory matters, including the False Claims Act, Zone Program Integrity Contractor (ZPIC) audits, and U.S. Department of Health and Human Services (HHS) Office of Inspector General (OIG) compliance. He is a member of the RACmonitor editorial board and a national correspondent for Monitor Mondays.

Related Stories

Compliance as a Revenue Strategy

Compliance as a Revenue Strategy

Grab your pens, alert your billing teams, and cue up your electronic medical record (EMR) vendors, because the Centers for Medicare & Medicaid Services (CMS)

Read More

Leave a Reply

Please log in to your account to comment on this article.

Featured Webcasts

The Cost of Ignoring Risk Adjustment: How HCCs Impact Revenue & Compliance

The Cost of Ignoring Risk Adjustment: How HCCs Impact Revenue & Compliance

Stop revenue leakage and boost hospital performance by mastering risk adjustment and HCCs. This essential webcast with expert Cheryl Ericson, RN, MS, CCDS, CDIP, will reveal how inaccurate patient acuity documentation leads to lost reimbursements through penalties from poor quality scores. Learn the critical differences between HCCs and traditional CCs/MCCs, adapt your CDI workflows, and ensure accurate payments in Medicare Advantage and value-based care models. Perfect for HIM leaders, coders, and CDI professionals.  Don’t miss this chance to protect your hospital’s revenue and reputation!

May 29, 2025
I050825

Mastering ICD-10-CM Coding for Diabetes and it’s Complications: Avoiding Denials & Ensuring Compliance

Struggling with ICD-10-CM coding for diabetes and complications? This expert-led webcast clarifies complex combination codes, documentation gaps, and sequencing rules to reduce denials and ensure compliance. Dr. Angela Comfort will provide actionable strategies to accurately link diabetes to complications, improve provider documentation, and optimize reimbursement—helping coders, CDI specialists, and HIM leaders minimize audit risks and strengthen revenue integrity. Don’t miss this chance to master diabetes coding with real-world case studies, key takeaways, and live Q&A!

May 8, 2025
2025 Coding Clinic Webcast Series

2025 ICD-10-CM/PCS Coding Clinic Update Webcast Series

Uncover critical guidance. HIM coding expert, Kay Piper, RHIA, CDIP, CCS, provides an interactive review on important information in each of the AHA’s 2025 ICD-10-CM/PCS Quarterly Coding Clinics in easy-to-access on-demand webcasts, available shortly after each official publication.

April 14, 2025

Trending News

Featured Webcasts

The Two-Midnight Rule: New Challenges, Proven Strategies

The Two-Midnight Rule: New Challenges, Proven Strategies

RACmonitor is proud to welcome back Dr. Ronald Hirsch, one of his most requested webcasts. In this highly anticipated session, Dr. Hirsch will break down the complex Two Midnight Rule Medicare regulations, translating them into clear, actionable guidance. He’ll walk you through the basics of the rule, offer expert interpretation, and apply the rule to real-world clinical scenarios—so you leave with greater clarity, confidence, and the tools to ensure compliance.

June 19, 2025
Open Door Forum Webcast Series

Open Door Forum Webcast Series

Bring your questions and join the conversation during this open forum series, live every Wednesday at 10 a.m. EST from June 11–July 30. Hosted by Chuck Buck, these fast-paced 30-minute sessions connect you directly with top healthcare experts tackling today’s most urgent compliance and policy issues.

June 11, 2025
Open Door Forum: The Changing Face of Addiction: Coding, Compliance & Care

Open Door Forum: The Changing Face of Addiction: Coding, Compliance & Care

Substance abuse is everywhere. It’s a complicated diagnosis with wide-ranging implications well beyond acute care. The face of addiction continues to change so it’s important to remember not just the addict but the spectrum of extended victims and the other social determinants and legal ramifications. Join John K. Hall, MD, JD, MBA, FCLM, FRCPC, for a critical Q&A on navigating substance abuse in 2025.  Register today and be a part of the conversation!

July 16, 2025

Trending News

Happy National Doctor’s Day! Learn how to get a complimentary webcast on ‘Decoding Social Admissions’ as a token of our heartfelt appreciation! Click here to learn more →

CYBER WEEK IS HERE! Don’t miss your chance to get 20% off now until Dec. 2 with code CYBER24