Editorial Revisado por pares

The Big Health Data–Intelligent Machine Paradox

2018; Elsevier BV; Volume: 131; Issue: 11 Linguagem: Inglês

10.1016/j.amjmed.2018.05.038

ISSN

1555-7162

Autores

D. Douglas Miller,

Tópico(s)

COVID-19 diagnosis using AI

Resumo

This paradox plays out in many high-stakes global arenas—geopolitics, climate change, and financial markets, to name a few. Artificial intelligence thrives on massive disparate data sets, so it is not surprising that these data-dense megatrends are being shaped by artificial intelligence1Le Cun Y Bengio Y Hinton GE Deep learning.Nature. 2015; 521: 436-444Crossref PubMed Scopus (42113) Google Scholar Although most humans do not face election, live near receding glaciers, or trade in cryptocurrencies, we are all learning that when manmade forces and intelligent machines collide, the costs and consequences can be real. So it is with human health data. Humans share 1 inherent right and bear 1 attendant risk—good health. In an inexorably older and progressively sicker world,2Frischetti, M. Developing countries are battling diseases of the rich and the poor. Available at: https://www.scientificamerican.com/article/developing-countries-are-battling-diseases-of-the-rich-and-poor/. Accessed February 20, 2018.Google Scholar another potential cataclysm confronts patients and physicians—the silent explosion of health data. The uses and fates of these health data, which will have grown from an estimated 153 exabytes in 2013 to 2314 exabytes by 2020 in the United States alone, are already being influenced by artificial intelligence.3Beam AL Kohane IS Big data and machine learning in health care.JAMA. 2018; 319: 1317-1318Crossref PubMed Scopus (706) Google Scholar Medical students and trainees learn humanistic principles based on the Hippocratic precept, Primum non nocere (‘First do no harm’). Will humans use intelligent machines wisely, using big data responsibly for health care and medical training? Or will their power and immensity paradoxically produce unintended harms? The expanding digital health data universe is the unstoppable force saturating the Cloud with big data droplets. Individual personal health information resides in and fluxes through 2 types of data repositories. Administrative health care databases are operated by entities responsible for resourcing care and managing costs in socialized and quasi-market health insurance systems. Administrative health care databases are massive multigenerational payloads of demographic and utilization data (ie, pharmacy, physician, ambulatory, and hospital services)4Cadarette SM Wong L An introduction to health care administrative data.Can J Hosp Pharm. 2015; 68: 232-237PubMed Google Scholar linked to multiple electronic medical records (EMRs) and increasingly homed in Cloud platforms compliant with the Health Insurance Portability and Accountability Act. Despite informatician and analyst expertise, their complexity and the density of their data defy standard statistical methods, limiting their applications to health care process and utilization management. The EMR is the immovable object of health care—it's not going anywhere. EMR databases (and picture-archiving communication systems) are intended to help providers coordinate complex patient care. Federal meaningful use mandates now find documentation consuming ∼50% of health professionals’ working time.5Wachter R, Goldsmith J. To combat physician burnout and improve care, fix the electronic health record. Available at: https://hbr.org/2018/03/to-combat-physician-burnout-and-improve-care-fix-the-electronic-health-record. Accessed June 29, 2018.Google Scholar The computer technologies underpinning EMRs remain relatively antiquated, with user-unfriendly dropdown menus, free typing entries, cut-and-paste functions, and point-and-click navigation. In the era of health insurance payments only for what is documented, physicians (and increasingly ‘scribes’) are rewarded for overinclusivity and data redundancy in EMR entries, adding quantity without necessarily adding quality to bloated health records. Efforts to insert policy between the unstoppable force and immovable object have largely failed—miserably, some might say. Population health research on repository data intended to design better health care delivery has morphed into population management. Population management is derivative of policymakers’ desire to align provider behaviors through the allocation of resources within well-defined health populations in accountable care organizations or patient-centered medical homes. One reason public policy carrot-and-stick forays into the hypercompetitive health care business have fallen short is that private EMR vendors, health insurers, and pharmacy vendors wield tremendous big data market power. To paraphrase India's Prime Minister Narendra Modi, “They who control the data control the world.” Again, health care is no exception. Data integrity requires the migration and storage of deidentified anonymized data. Safeguards restrict repository access for researchers and data miners. Notwithstanding this, Apple and 13 prominent U.S. health systems recently announced plans to download EMR data (patient permission pending) onto Apple iCloud servers. Hailed by some as “truly disruptive” and “game-changing,”6Blumenthal, D, Chopra, A. Apple's pact with 13 health care systems might actually disrupt the industry. Available at: https://hbr.org/2018/03/apples-pact-with-13-health-care-systems-might-actually-disrupt-the-industry. Accessed June 29, 2018.Google Scholar the degree to which affected patients will own their individual data deposits remains unclear. Absent political courage and health insurer permission to grant citizens rights to own their individual personal health information for health benefits, the promise of personalized medicine remains adrift in big data space. In the profitable U.S. $570 billion health insurance sector (2.7% of the U.S. gross domestic product), it comes as no surprise to find the biggest insurers using self-learning artificial intelligence machines to mine their data troves. Insurers contend that artificial intelligence helps customers choose the right health plan (virtual assistants), professionals manage chronic diseases (biometric device tracking), and providers boost their star quality ratings (compliance-enhancing bot calls and text messaging). Recent health insurance market consolidations—vertical acquisitions (ie, Anthem's attempted Cigna purchase) and horizontal mergers (ie, CVS Health-Aetna, Cigna-Express Scripts, Walmart-Humana)—were pursued (in part) to accumulate valuable patient data. Mergers and acquisitions activity has also secured in-house artificial intelligence analytics capabilities for health systems (ie, Health Share of Oregon buying Health Catalyst) and insurers (Cigna buying Brighter AI). An industry-insider maxim is that artificial intelligence will “improve health insurance claims management” (ie, coverage eligibility, explanation of benefits, payments).7Hehner, S, Kors, B, Martin, M, et al. Artificial intelligence in health insurance. Available at: https://healthcare.mckinsey.com/sites/default/files/Artificial%20intelligence%20in%20Health%20Insurance.pdf. Accessed March 15, 2018.Google Scholar Insurers’ message to plan beneficiaries, whose big and little data are being scraped in Facebook-ian fashion, doesn't yet include lower premiums or better coverage. A 2017 industry survey8From mystery to mastery: unlocking the business value of artificial intelligence in the insurance industry. Available at: https://www2.deloitte.com/content/dam/Deloitte/de/Documents/Innovation/Artificial-Intelligence-in-Insurance-Whitepaper-deloitte-digital.pdf. Accessed March 15, 2018.Google Scholar showed that 40% of U.S. insurance customers would track and share their health behavior data for more accurate premiums. However, when the London National Health Service Trust trusted Google enough to provide its artificial intelligence subsidiary (DeepMind) with >1.6 million patients’ data, the privacy regulator determined that they had failed to comply with United Kingdom data protection laws. The immovable object and unstoppable force have converged and are hurtling Earthward. What can be done to save humanity from big health data annihilation? Achieving elusive EMR interoperability, impenetrable Cloud security, and actionable analytics would each be welcome relief. New artificial intelligence computing technologies poised on the launch pad offer more potent solutions. Simple recurrent neural networks are trained on large data sets (108-1010 elements) to carry forward recent data run solutions (at time t – 1) to impact current (time t) and future (time t + 1) outputs (Figure 1A). This time-spanning neural network ‘hidden layer’ back-propagation approach creates an algorithm-derived vector-weighted short-term memory that is well suited to sequential data input tasks1Le Cun Y Bengio Y Hinton GE Deep learning.Nature. 2015; 521: 436-444Crossref PubMed Scopus (42113) Google Scholar (Figure 1B). Recurrent neural networks can read EMR text entries to generate problem lists9Tsou, C-H, Devarakonda, M, Liang, JJ. Toward generating domain-specific / personalized problem lists from electronic medical records. Available at: https://www.aaai.org/ocs/index.php/FSS/FSS15/paper/viewFile/11733/11479. Accessed March 1, 2018.Google Scholar and translate digital images into meaningfully worded descriptive reports10Li X Jin Q Improving image captioning by concept-based sentence reranking.in: Chen E Gong Y Tie Y Advances in Multimedia Information Processing—PCM 2016.17th ed. Springer, New York, NY2016: 231-240Crossref Scopus (2) Google Scholar (Figure 2).Figure 2Researchers registered with CLEF dev set can access open-source captioned image databases (ie, plant specimen photos, digital medical images) and submit requests to use large training data sets to run artificial intelligence analytics.10Li X Jin Q Improving image captioning by concept-based sentence reranking.in: Chen E Gong Y Tie Y Advances in Multimedia Information Processing—PCM 2016.17th ed. Springer, New York, NY2016: 231-240Crossref Scopus (2) Google Scholar The upper run illustrates training a sentence-generating model on an ImageCLEF dev set. The lower run first trains the model on a Microsoft MS COCO image data set, then tests it on the ImageCLEF dev set. An algorithm scoring system (ie, METEOR) is used to assess the performance of different image-captioning software for concept detection (ie, a higher METEOR score equals better concept detection). Concept-based sentence re-ranking can then be applied on sentences generated by these LSTM-RNN models. The outcome sentence, “A plant with pink flower and brown stem…,” reflects the transformed hidden state description of the original image as generated by the neural network system. CLEF = Cross Language Evaluation Forum; CNN = convolutional neural network; LSTM = long short-term memory unit; METEOR = Metric for Evaluation of Translation with Explicit Ordering; MS COCO = Microsoft Coco framework; RNN = recurrent neural network.View Large Image Figure ViewerDownload Hi-res image Download (PPT) Memory networks have greatly expanded recurrent neural networks’ limited short-term memory into a tape-like ‘remember-the-story’ functionality for later data queries.11Weston J Chopra S Bordes A. Memory networks.https://arxiv.org/pdf/1410.3916v11.pdfGoogle Scholar Memory networks read a story, are trained to keep track of embedded data imagery, and can correctly answer questions like “Where is our story's heroine walking right now?” The game-changing question for medicine, “Where is my diabetic patient in her chronic disease trajectory?,” can now be answered by long short-term memory units that learn to gate the storage, release, or erasure of recurrent neural network hidden layer data, effectively linking causes to effects.12Pham T Tran T Phung D Venkatesh S Predicting healthcare trajectories from medical records: a deep learning approach.J Biomed Inform. 2017; 69: 218-229Abstract Full Text Full Text PDF PubMed Scopus (239) Google Scholar Memory networks and long short-term memory units will soon exert a powerful gravitational pull on both the unstoppable force and the immovable object. Despite cautionary human wisdom about artificial intelligence technology insertion,3Beam AL Kohane IS Big data and machine learning in health care.JAMA. 2018; 319: 1317-1318Crossref PubMed Scopus (706) Google Scholar, 13Miller DD Brown EW Artificial intelligence in medical practice: the question to the answer?.Am J Med. 2018; 131: 129-133Abstract Full Text Full Text PDF PubMed Scopus (290) Google Scholar perhaps the ultimate paradox is that the very same intelligent machines some fear will hurt jobs are the last best hope for saving the health care universe from entering deeper into a data explosion–provider documentation ‘black hole.’

Referência(s)