<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Neurostats: Surrogate Dispatches]]></title><description><![CDATA[Surrogate dispatches offers short form commentary about surrogate models, markers, and metrics used as evidence for decision-making in biomedical research. ]]></description><link>https://blog.neurostats.org/s/surrogate-dispatches</link><image><url>https://substackcdn.com/image/fetch/$s_!4sfZ!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39a72a63-441e-4005-8a8e-a0b07b5ab602_1024x1024.png</url><title>Neurostats: Surrogate Dispatches</title><link>https://blog.neurostats.org/s/surrogate-dispatches</link></image><generator>Substack</generator><lastBuildDate>Mon, 18 May 2026 06:34:46 GMT</lastBuildDate><atom:link href="https://blog.neurostats.org/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Manjari Narayan]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[neurostats@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[neurostats@substack.com]]></itunes:email><itunes:name><![CDATA[Manjari Narayan]]></itunes:name></itunes:owner><itunes:author><![CDATA[Manjari Narayan]]></itunes:author><googleplay:owner><![CDATA[neurostats@substack.com]]></googleplay:owner><googleplay:email><![CDATA[neurostats@substack.com]]></googleplay:email><googleplay:author><![CDATA[Manjari Narayan]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Request for Information: Synthetic Datasets]]></title><description><![CDATA[Share how you validate synthetic data as a proxy for the real data]]></description><link>https://blog.neurostats.org/p/open-thread-synthetic-datasets</link><guid isPermaLink="false">https://blog.neurostats.org/p/open-thread-synthetic-datasets</guid><dc:creator><![CDATA[Manjari Narayan]]></dc:creator><pubDate>Fri, 14 Nov 2025 18:28:43 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/165228a6-7af4-4957-80e2-4ecc4f0f0f13_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>TL;DR; If you know something about this, have written or read papers on this please comment.</em> </p><p>There are broadly two approaches to analyze data and distribute results when the data is not available to share for legal and privacy reasons. </p><ul><li><p>Federated learning</p></li><li><p>Obscure the data via differential privacy or synthetic data generation</p></li></ul><p><strong>Where is the normative theory for when synthetic data is trustworthy?</strong> </p><p>There are no universal surrogate endpoints for therapeutic evaluation, because there is no way to produce a universal surrogate for every possible decision. If something is a good proxy, it has been shown to be a good proxy for a particular purpose. The same goes for synthetic datasets when used in-lieu of the real thing. How do you just generate synthetic data in a vacuum or pre-emptively before knowing what decisions are going to be made on it?</p><p>The only one I really understand is differential privacy due to its theoretical origins, but there are a wide variety of synthetic data generation schools of thought and approaches today. In practice, many of them are based on intuitive heuristics. But how do we what they are good for and when such data might be misleading?</p><ul><li><p>Counterfactual imputation of missing data is &#8220;synthetic&#8221; data</p></li><li><p>LLM benchmarks: take a seed template problem and using language models to scale up many variations of the problem. This makes problems realistic but &#8220;synthetic&#8221; in that they are not written up by humans as in FrontierMath.</p></li><li><p>Generative models trained on large biological datasets as a source of new synthetic observations. We see this right now in both brain imaging research and genomics. </p></li></ul><p><strong>In short, how do we know that a synthetically generated dataset produces optimal and valid answers for every kind of scientific question and decision that could be made on it? </strong></p><p></p>]]></content:encoded></item><item><title><![CDATA[Biomarker Qualification]]></title><description><![CDATA[Qualification is to regulators what validation is to scientists]]></description><link>https://blog.neurostats.org/p/biomarker-qualification</link><guid isPermaLink="false">https://blog.neurostats.org/p/biomarker-qualification</guid><dc:creator><![CDATA[Manjari Narayan]]></dc:creator><pubDate>Mon, 18 Aug 2025 20:00:00 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/b1365ac0-a30a-48a3-ae36-3000c76dac6f_946x834.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Definition: Biomarker qualification is about thinking through the full chain of evidence to prove that a biomarker can be used for a particular clinical decision. </em></p><p>HDL serum cholesterol, for instance, is great for evaluating risk of heart disease but not for evaluating effectiveness of treatments to improve cardiovascular health. There is no such thing as a &#8220;good biomarker&#8221; in a vacuum. Decisions to use biomarkers are always dependent on the intended applications.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> Sadly, this is not something that most biomedical researchers think about when they do biomarker discovery. </p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.neurostats.org/glossary&quot;,&quot;text&quot;:&quot;Back to glossary&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://blog.neurostats.org/glossary"><span>Back to glossary</span></a></p><div><hr></div><p>Biomarker guided therapeutic decisions require developing and validating biomarkers. Specifying what these criteria are requires constant meta-scientific innovation. It is easy to conflate the enterprise of biomarker validation with analytical validation<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> , followed by reproducible clinical studies. But what constitutes clinical validation?</p><p>Analytical validation is all about evaluating the measurement process or assay. Many biomarker discovery studies will demonstrate test-retest reliability that looks for whether people can be reliably differentiated based on their biomarker measurements. One has to evaluate a far more thorough checklist of measurement issues that go well beyond test-retest reliability for analytical validation. We also need repeatability of measurements for any individual with good tolerance intervals, comparability of quantitative measurements in a wide variety of circumstances and many others. Yet, analytical validation is the easier part of the biomarker evaluation process with systematic criteria. <em><strong>Qualification</strong></em> on the other hand encompasses the full spectrum of validity problems across all the life-medical-health sciences &#8212;&nbsp;it includes all the possible &#8220;<em>does it mean what you think it means</em>&#8221; problems. Biomarker qualification includes assessing the clinical validity<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> of the biomarker as well as other validation criteria specific to a therapeutic decision.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> </p><p>Importantly, there is no easy way to preemptively specify what all the threats to validity are &#8212; construct validity, causal validities including internal validity and external validity, all the modern validities beyond reliability of the measurement that link it to disease and/or therapeutic outcomes. </p><p></p><h4><strong>Altar et. al. (2008)</strong></h4><p>Here is a concrete example of what a comprehensive understanding of biological and clinical validation looks like. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!behi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba5868a-ea9f-4396-869b-f5a01a475a0a_1472x1280.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!behi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba5868a-ea9f-4396-869b-f5a01a475a0a_1472x1280.png 424w, https://substackcdn.com/image/fetch/$s_!behi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba5868a-ea9f-4396-869b-f5a01a475a0a_1472x1280.png 848w, https://substackcdn.com/image/fetch/$s_!behi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba5868a-ea9f-4396-869b-f5a01a475a0a_1472x1280.png 1272w, https://substackcdn.com/image/fetch/$s_!behi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba5868a-ea9f-4396-869b-f5a01a475a0a_1472x1280.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!behi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba5868a-ea9f-4396-869b-f5a01a475a0a_1472x1280.png" width="1456" height="1266" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bba5868a-ea9f-4396-869b-f5a01a475a0a_1472x1280.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1266,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:415089,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.neurostats.org/i/172505027?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba5868a-ea9f-4396-869b-f5a01a475a0a_1472x1280.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!behi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba5868a-ea9f-4396-869b-f5a01a475a0a_1472x1280.png 424w, https://substackcdn.com/image/fetch/$s_!behi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba5868a-ea9f-4396-869b-f5a01a475a0a_1472x1280.png 848w, https://substackcdn.com/image/fetch/$s_!behi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba5868a-ea9f-4396-869b-f5a01a475a0a_1472x1280.png 1272w, https://substackcdn.com/image/fetch/$s_!behi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbba5868a-ea9f-4396-869b-f5a01a475a0a_1472x1280.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Credit: Altar, C.A. <em>et al.</em> (2008) &#8216;A prototypical process for creating evidentiary standards for biomarkers and diagnostics&#8217;, <em>Clinical pharmacology and therapeutics</em>, 83(2), pp. 368&#8211;371. https://doi.org/10.1038/sj.clpt.6100451.</p><p></p><p>However, this table reflects 20th century understanding. It could use significant updating given how far scientific and statistical methodology has come in 20 years. </p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.neurostats.org/p/biomarker-qualification?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.neurostats.org/p/biomarker-qualification?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><div><hr></div><h3>Checklist for a biomedical stakeholder</h3><p>Analytical validation and biomarker qualification are terms of art when biomarkers are proposed for drug development decisions in clinical trials. Unfortunately, these terms are not widely used within mainstream biomedical research. It is easy to think these are regulatory concerns don&#8217;t matter until one wants to bring biomarkers to the clinic, as opposed to scientific concerns that need to be addressed. Every research community has its own epistemic norms around &#8220;validation&#8221;. When you visit premier conferences in different niches of life science where biomarker research occurs, these differences become apparent. No one actually owns the problem of understanding the full scope of scientific R&amp;D that needs to occur.</p><p>If you read an article that calls for a large-scale validation for new biomarkers, here is what you should ask yourself &#8212; </p><ol><li><p><em>Is it clear that the biologist or scientist&#8217;s notion of validation is distinct from analytical validation? Has it at least considered all the problems in Altar 2008, for instance?  </em></p></li><li><p><em>Does the roadmap for biological plausibility and clinical validation cover the full spectrum of research designs and grades of evidence that need to be generated? </em></p></li><li><p><em>Does it address all threats to scientific validity known to methodologists for a particular decision or context of use, even outside one disease area?</em></p></li></ol><p>If not, then the field might need a better specification of the validation roadmap. It is far too late to do the necessary R&amp;D if you wait until someone is ready to initiate conversations with the FDA. </p><p></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.neurostats.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.neurostats.org/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Institute of Medicine. 2010. Evaluation of Biomarkers and Surrogate Endpoints in Chronic Disease. Washington, DC: The National Academies Press. https://doi.org/10.17226/12869.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p><a href="https://www.fda.gov/media/161201/download">FDA on analytical validation</a>, <a href="https://www.ema.europa.eu/en/documents/scientific-guideline/ich-guideline-q2r2-validation-analytical-procedures-step-2b_en.pdf">ICH on analytical validation</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Ransohoff, D. Bias as a threat to the validity of cancer molecular-marker research. <em>Nat Rev Cancer</em> <strong>5</strong>, 142&#8211;149 (2005). https://doi.org/10.1038/nrc1550 </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Fleming, T.R. and Powers, J.H. (2012) &#8216;Biomarkers and surrogate endpoints in clinical trials&#8217;, <em>Statistics in medicine</em>, 31(25), pp. 2973&#8211;2984. https://doi.org/10.1002/sim.5403.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Predictive (in)validity]]></title><description><![CDATA[Translational probabilities of success and selection bias]]></description><link>https://blog.neurostats.org/p/forecasting-translational-probabilities-of-success</link><guid isPermaLink="false">https://blog.neurostats.org/p/forecasting-translational-probabilities-of-success</guid><dc:creator><![CDATA[Manjari Narayan]]></dc:creator><pubDate>Mon, 12 May 2025 17:00:00 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/28c5c10e-88e6-44f4-a812-80eb401469af_1024x1536.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>It is common in late stage drug development for pharmaceutical companies to use rigorous quantitative decision making to plan out the different stages of clinical trials and how to evaluate the risk of clinical trial failures. This often goes under the area of <em>assurance</em> or <em>predictive probabilities of success</em>. </p><p>The original use-case for assurance was to <a href="https://pubmed.ncbi.nlm.nih.gov/23957523/">optimize sample size choices</a> in clinical trials! </p><blockquote><p><em><strong>Assurance</strong> is the unconditional probability that the trial will yield a &#8216;positive outcome&#8217;. A positive outcome usually means a statistically significant result, according to some standard frequentist significance test. The assurance is then the prior expectation of the power, averaged over the prior distribution for the unknown true treatment effect.</em></p><p><em>We argue that assurance is an important measure of the practical utility of a proposed trial, and indeed that it will often be appropriate to choose the size of the sample (and perhaps other aspects of the design) to achieve a desired assurance, rather than to achieve a desired power conditional on an assumed treatment effect.<br><br></em>O'Hagan, A., Stevens, J.W. and Campbell, M.J. (2005), Assurance in clinical trial design. Pharmaceut. Statist., 4: 187-201. <a href="https://doi.org/10.1002/pst.175">https://doi.org/10.1002/pst.175</a></p></blockquote><p><br>Most major pharmaceutical companies like <a href="https://ascpt.onlinelibrary.wiley.com/doi/10.1002/cpt.2488">Novartis</a>, <a href="https://pubmed.ncbi.nlm.nih.gov/36631827/">GSK</a>, and <a href="https://www.tandfonline.com/doi/full/10.1080/10543406.2014.972508">Roche</a> all have teams who design methodology for assurance, sometimes even for <a href="https://pubmed.ncbi.nlm.nih.gov/36631827/">clinical pharmacology and dosing</a> decisions, for trials based on <a href="https://onlinelibrary.wiley.com/doi/10.1002/sim.8060">surrogate endpoints</a>.  The quality of a <a href="https://www.tandfonline.com/doi/full/10.1080/10543406.2013.814377">biomarker </a> endpoint needs to be substantial to guarantee assurance. <br><br><a href="https://onlinelibrary.wiley.com/doi/full/10.1002/pst.2128">Conditional assurance</a> is a further extension of the original concept.</p><div class="pullquote"><p><br>Conditional assurance is the predicted assurance of a subsequent study, conditional on the success of an initial study and the design prior.<br></p></div><p><br>But what about analogous probabilistic forecasting for earlier stages of drug development? <a href="https://www.nature.com/articles/s41573-022-00552-x">Scannell et. al (2022)</a> point out that the lack of predictive validity of high throughput screening tools and translational models are in-part responsible for the inefficiency of biopharma R&amp;D. One might think that simply correlating the outcomes of a translational model with binary outcomes like drug approvals/failures would provide a good assessment of the predictive validity of translational models. But this is mistaken for a few conceptual reasons</p><ul><li><p>Clinical trial failures have a <strong>file-drawer problem</strong>, thus we don&#8217;t have an unbiased estimate of even historical successes/failures of some category of molecules-indication combinations. Forget forecasting, we don&#8217;t have an unbiased prediction error of historical drug development. One major but not only component of the file-drawer here is right censoring. </p></li><li><p><strong>Right censoring problem</strong>: The sequential nature of drug development implies a sequential drop off in the number of drugs tested from phase 1 &#8594; phase 2  &#8594; phase 3 trials. Thus we we don&#8217;t get to assess the future clinical validity of all assays of therapeutic potency, toxicity, DMPK, disease models, putative surrogate endpoints and so on without severe omission biases.</p></li><li><p><strong>Underspecification of success:</strong> Defining what constitutes success is a huge outcome and metric-hacking problem. There is a reason so much time and resources are dedicated to choosing endpoints in clinical trials, so that evaluations cannot be gamed. Similarly, choosing what constitutes success for pharmaceutical forecasting at every stage of drug development is subject to external pressures. </p></li></ul><p>One solution to resolving the omission bias problem is to move away from using binary measures of success such as drug approval/failure to using actual individual participant data from clinical trials which has much richer sources of variation to assess the clinical validity of  in-vitro/in-silico/in-vivo models and assays. This was the basis of the <a href="https://docs.google.com/document/d/1Npt134MFmtuKbi7Euld3FW8ORicrjtFf9vmNKPU4QKE/edit?usp=sharing">program idea</a> I developed last year during my time as a BRAINS Fellow with SpecTech, also on featured on <span class="mention-wrap" data-attrs="{&quot;name&quot;:&quot;Essential Technology&quot;,&quot;id&quot;:4039658,&quot;type&quot;:&quot;pub&quot;,&quot;url&quot;:&quot;https://open.substack.com/pub/convergentresearch&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/65917eee-cde1-4770-8e14-26f3c3c73d18_1280x1280.png&quot;,&quot;uuid&quot;:&quot;8f9cc837-dc6e-4998-b7f0-896cc6e012cf&quot;}" data-component-name="MentionToDOM"></span>&#8217;s  <a href="https://www.gap-map.org/gaps/clinical-trials-are-poorly-optimized-for-evidence-gathering/">Gap Map database</a>. But there are even more important reasons to use individual patient-level data to evaluate translational forecasts &#8212;&nbsp;to reduce the <a href="https://onlinelibrary.wiley.com/doi/abs/10.1111/biom.12071">risk of surrogate paradoxes</a> and other clinical trial failures that amount to getting the therapeutic benefit-adverse effect tradeoff wrong.</p><p>In principle, a translational analogs of clinical assurance for every tool and model used for drug development would be the principled approach to keeping track of their effectiveness. AI in drug development is not merely about evaluating therapeutic candidates during hit and lead optimization, but also useful to severely test the translational and clinical validity of drug development tools  &#8212;&nbsp;all in-vitro, in-silico, in-vivo tools individually and jointly for their capacity to generate clinically valid forecasts. Even without raw preclinical and clinical trial datasets, however, there are a multitude of clever approaches to mitigating the biases from the sequential nature of clinical development. </p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.neurostats.org/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribe to hear more about improving translational success</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>  </p>]]></content:encoded></item></channel></rss>