{"id":10926,"date":"2014-05-28T07:52:52","date_gmt":"2014-05-28T11:52:52","guid":{"rendered":"http:\/\/n2value.com\/blog\/?p=10926"},"modified":"2014-05-28T07:52:52","modified_gmt":"2014-05-28T11:52:52","slug":"what-big-data-visualization-analytics-can-learn-from-radiology","status":"publish","type":"post","link":"https:\/\/n2value.com\/blog\/what-big-data-visualization-analytics-can-learn-from-radiology\/","title":{"rendered":"What Big Data visualization analytics can learn from radiology"},"content":{"rendered":"<p>As I research on part III of the \u201c<a title=\"What Medicine can learn from Wall Street \u2013 Part 2 \u2013 evolution of data analysis\" href=\"http:\/\/n2value.com\/blog\/what-medicine-can-learn-from-wall-street-part-2-evolution-of-data-analysis\/\">What Healthcare can learn from Wall Street<\/a>\u201d series, which is probably going to turn in to a Part III, Part IV, and Part V, I was thinking about visualization tools in big data and how to use them to analyze large data sets rapidly (relatively) by a human (or a deep unsupervised learning type algorithm) &#8211; and it came to me that us radiologists have been doing this for years.<br \/>\nIf you have ever watched a radiologist reading at a PACS station (a high-end computer system which displays images quickly) you will see them scroll at a blindingly fast speed through a large series of multiple anatomic images to arrive at a diagnosis or answer a specific question. \u00a0[N.B. if you haven\u2019t, you really should &#8211; it&#8217;s quite cool!] \u00a0Stacked upon each other, these images assemble a complete anatomic picture of the area of data acquisition.<\/p>\n<p>What the radiologist is doing while going over the images is comparing the expected appearance of a reference standard to that visualized image to find discrepancies. \u00a0The data set looks like THIS:<\/p>\n<p style=\"padding-left: 30px;\"><a href=\"http:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/CT-scan-segmentation.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-10922\" src=\"http:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/CT-scan-segmentation.jpg\" alt=\"CT scan segmentation\" width=\"405\" height=\"261\" srcset=\"https:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/CT-scan-segmentation.jpg 405w, https:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/CT-scan-segmentation-300x193.jpg 300w\" sizes=\"auto, (max-width: 405px) 100vw, 405px\" \/><\/a><a href=\"http:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/voxel.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft wp-image-10924\" src=\"http:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/voxel.jpg\" alt=\"voxel\" width=\"169\" height=\"160\" srcset=\"https:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/voxel.jpg 305w, https:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/voxel-300x284.jpg 300w\" sizes=\"auto, (max-width: 169px) 100vw, 169px\" \/><\/a>It&#8217;s important to understand that each pixel on the screen represents not a point, but a volume, called a voxel. \u00a0The reconstruction algorithms can sometimes over or under emphasize the appearance of the voxel, so the data is usually reconstructed in multiple axes. \u00a0This improves diagnostic accuracy and confidence.<\/p>\n<p>Also, the voxel is not a boolean (binary) zero or one variable &#8211; it is a scalar corresponding to a grey-scale value.<\/p>\n<p>So, in data science thinking, what a radiologist is doing is examining a four-dimensional space (X,Y,Z, voxel grayscale) for relevant patterns and deviance from those patterns (Essentially a subtractive algorithm). \u00a0A fifth dimension can be added by including changes over time (comparison to a previous similar study at some prior point in time).<\/p>\n<p>Rapid real-time pattern recognition in five variables on large data sets. \u00a0Done successfully day-in and day-out visually by your local radiologist.<\/p>\n<p>&nbsp;<\/p>\n<p>Initial evaluation of a complex data set can give you something like this multiple scatter plot which I don&#8217;t find too useful:<\/p>\n<p><a href=\"http:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/Multiple-scatter-plots.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-10923\" src=\"http:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/Multiple-scatter-plots.png\" alt=\"Multiple scatter plots\" width=\"413\" height=\"382\" srcset=\"https:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/Multiple-scatter-plots.png 413w, https:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/Multiple-scatter-plots-300x277.png 300w\" sizes=\"auto, (max-width: 413px) 100vw, 413px\" \/><\/a><\/p>\n<p>Now, this data set, to me with my orientation and training, becomes much more useful:<\/p>\n<p><a href=\"http:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/3D-dataset.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-10921\" src=\"http:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/3D-dataset.png\" alt=\"3D dataset\" width=\"480\" height=\"329\" srcset=\"https:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/3D-dataset.png 480w, https:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/3D-dataset-300x205.png 300w\" sizes=\"auto, (max-width: 480px) 100vw, 480px\" \/><\/a>A cursory visual inspection yields a potential pattern, the orange circles, which to me suggests a possible model drawn in blue.\u00a0 <a href=\"http:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/visuallyevaluated.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-10925\" src=\"http:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/visuallyevaluated.jpg\" alt=\"visuallyevaluated\" width=\"480\" height=\"329\" srcset=\"https:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/visuallyevaluated.jpg 480w, https:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/visuallyevaluated-300x205.jpg 300w\" sizes=\"auto, (max-width: 480px) 100vw, 480px\" \/><\/a>That curve looks parabolic, which suggests a polynomial linear model might be useful for describing that particular set of data, so we can model it like this and then run the dataset in R to prove or disprove our hypothesis.<br \/>\n<a href=\"http:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/CodeCogsEqn.gif\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-10930\" src=\"http:\/\/n2value.com\/blog\/wp-content\/uploads\/2014\/05\/CodeCogsEqn.gif\" alt=\"Polynomial Linear Model\" width=\"304\" height=\"38\" \/><br \/>\n<\/a>So, what I&#8217;m suggesting here is that by visually presenting complex data in a format of up to five dimensions (three axes, X, Y,Z, a point with grayscale corresponding to a normalized value, and a fifth, comparative dimension) complex patterns can be visually discovered, potentially quickly and on a screening basis, and then appropriate models can be tested to discover if they hold water.\u00a0 I&#8217;ll save the nuts and bolts of this for a later post, but when a large dataset is evaluated (like an EHR) dimension reduction operations can allow focusing down on fewer variables to put it into a more visualization-friendly dataset.<\/p>\n<p>And I&#8217;m willing to bet even money that if an analyst becomes intimately familiar with the dataset and visualization, as they spend more time with it and understand it better, they will be able to pick out relationships that will be absolutely mind-blowing.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>As I research on part III of the \u201cWhat Healthcare can learn from Wall Street\u201d series, which is probably going to turn in to a Part III, Part IV, and Part V, I was thinking about visualization tools in big data and how to use them to analyze large data sets rapidly (relatively) by a [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"New N2value post: What #BigData visualization analytics can learn from #radiology http:\/\/wp.me\/p4mtfP-2Qe","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false},"version":2}},"categories":[4,2,6],"tags":[],"class_list":["post-10926","post","type-post","status-publish","format-standard","hentry","category-data-science","category-healthcare","category-process-analytics"],"jetpack_publicize_connections":[],"aioseo_notices":[],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/p4mtfP-2Qe","jetpack_sharing_enabled":true,"jetpack_likes_enabled":true,"_links":{"self":[{"href":"https:\/\/n2value.com\/blog\/wp-json\/wp\/v2\/posts\/10926","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/n2value.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/n2value.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/n2value.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/n2value.com\/blog\/wp-json\/wp\/v2\/comments?post=10926"}],"version-history":[{"count":14,"href":"https:\/\/n2value.com\/blog\/wp-json\/wp\/v2\/posts\/10926\/revisions"}],"predecessor-version":[{"id":10971,"href":"https:\/\/n2value.com\/blog\/wp-json\/wp\/v2\/posts\/10926\/revisions\/10971"}],"wp:attachment":[{"href":"https:\/\/n2value.com\/blog\/wp-json\/wp\/v2\/media?parent=10926"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/n2value.com\/blog\/wp-json\/wp\/v2\/categories?post=10926"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/n2value.com\/blog\/wp-json\/wp\/v2\/tags?post=10926"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}