Partial update to results, fixing appencicies.

1 year ago · fff56b52ea
parent fb644c6c5d
commit fff56b52ea
8 changed files with 333 additions and 424 deletions
--- a/Paper/Main.tex
+++ b/Paper/Main.tex
@ -94,15 +94,19 @@ completion of clinical trials\\ \small{Preliminary Draft}}
 \printbibliography
 \newpage
 \appendix
 %---------------------------------------------------------------
-\section{Appendicies}
+\section{Diagnostics}\label{Appendix:Diagnostics}
 %---------------------------------------------------------------
 \subfile{sections/21_appendix_diagnostics}
 %---------------------------------------------------------------
 \section{Other Statistical Results}\label{Appendix:Results}
 %---------------------------------------------------------------
 \subfile{sections/22_appendix_full_results}
 \newpage
 \tableofcontents
 \end{document}
 % NOTES: 
 % 
 % 
--- a/Paper/sections/06_Results.tex
+++ b/Paper/sections/06_Results.tex
@ -71,33 +71,6 @@ not represented at all.
    \label{FIG:barchart_idc_categories}
 \end{figure}
 % Estimation Procedure
 I fit the econometric model using mc-stan 
 \cite{standevelopmentteam_StanModelling_2022}
 through the rstan 
 \cite{standevelopmentteam_RStanInterface_2023}
 interface using 4 chains with 
 %describe  
 2,500
 warmup iterations and
 2,500
 sampling iterations each.
 Two of the chains experienced a low 
 Estimated Baysian Fraction of Missing Information (E-BFMI) ,
 suggesting that there are some parts of the posterior distribution
 that were not explored well during the model fitting. 
 I presume this is due to the low number of trials in some of the 
 ICD-10 categories.
 We can see in Figure \ref{fig:barchart_idc_categories} that some of these 
 disease categories had a single trial represented while others were 
 not represented at all.
 \begin{figure}[H]
    \includegraphics[width=\textwidth]{../assets/img/trials_details/CategoryCounts}
    \caption{Bar chart of trials by ICD-10 categories}
    \label{fig:barchart_idc_categories}
 \end{figure}
 \subsection{Primary Results}
@ -111,7 +84,6 @@ keeping enrollment open.
 \begin{figure}[H]
    \includegraphics[width=\textwidth]{../assets/img/dist_diff_analysis/p_delay_intervention_distdiff_boxplot}
    \todo{Replace this graphic with the histdiff with boxplot}
    \small{
        Values near 1 indicate a near perfect increase in the probability 
        of termination. 
@ -128,16 +100,14 @@ keeping enrollment open.
 There are a few interesting things to point out here. 
 Let's start by getting aquainted with the details of the distribution above.
 It can be devided into a few different regimes.
 % - spike at 0
 % - the boxplot
 % - 63% of mass below 0 : find better way to say that
 %   - For a random trial, there is a 63% chance that the impact is to reduce the probability of a termination.
 % - 2 pctg-point wide band centered on 0 has ~13% of the masss
 % - mean represents 9.x% increase in probability of termination. A quick simulation gives about the same pctg-point increase in terminated trials.
 A few interesting interpretation bits come out of this.
 % - there are 3 regimes: low impact (near zero), medium impact (concentrated in decreased probability of termination), and high impact (concentrated in increased probability of termination). 
 The first this that there appear to be three different regimes. 
 The first regime consists of the low impact results, i.e. those values of $\delta_p$ 
 near zero. 
 About 13\% of trials lie within a single percentage point change of zero, 
@ -155,71 +125,57 @@ from a case where they were highly likely to complete their primary objectives t
 a case where they were likely or almost certain to terminate the trial early.
 %   - the high impact regime is strange because it consists of trials that moved from unlikely (<20% chance) of termination to a high chance (>80% chance) of termination. Something like 5% of all trials have a greater than 98 percentage point increase in termination. Not sure what this is doing. 
-%   - Potential Explanations for high impact regime:
+Based on the boxplot below, there are a couple of things to note.
-How could this intervention have such a wide range in the intensity 
+First, the median effect is a 2.3 percentage point decrease 
-and direction of impacts?
+in the probability of termination.
-A few explanations include that some trials are suceptable or that this is a 
+Second, for a random selction from our trials, 
-result of too little data.
+there is a 63\% chance that the impact is to 
-%       - Some trials are highly suceptable. This is the face value effect
+reduce the probability of a termination.
-One option is that some categories are more suceptable to 
+Third, about 13\% of the probability mass is contained within the interval 
-issues with participant enrollment. 
+[-0.1,0.1].
-If this is the case, we should be able to isolate categories that contribute
+Finally, the mean effect is measured as a 9.6 percentage point increase in 
-the most to this effect.
+the probability of termination.
-Another is that this might be a modelling artefact, due to the relatively
+The full percentile table can be found in 
-low number of trials in certain ICD-10 categories. 
+\ref{TABLE:PercentilesOfDistributionOfDifferences}
-In short, there might be high levels of uncertanty in some parameter values,
+in appendix
-which manifest as fat tails in the distributions of the $\beta$ parameters. 
+\ref{Appendix:Results}
-Because of the logistic format of the model, these fat tails lead to 
+
-extreme values of $p$, and potentally large changes $\delta_p$. 
+% Looking at the spike around zero, we find that 13.09% of the probability mass 
-%       - Could be uncertanty. If the model is highly uncertain, e.g. there isn't enough data, we could have a small percentage of large increases. This could be in general or just for a few categories with low amounts of data.
+% is contained within the band from [-1,1]. 
-% - 
+% Additionally, there was 33.4282738% of the probability above that 
-% - 
+% – representing those with a general increase in the 
-
+% probability of termination – and 53.4817262% of the probability mass 
-I believe that this second explanation -- a model artifact due to uncertanty --
+% below the band – representing a decrease in the probability of termination. 
-is likely to be the cause. 
+% On average, if you keep the trial open instead of closing it, 0.6337363% of
-Three points lead me to believe this:
+% trials will see a decrease in the probability of termination, but, due to 
-\begin{itemize}
+% the high increase in probability of termination given termination was 
-    \item The low fractions of E-BFMI suggest that the sampler is struggling 
+% increased, the mean probability of termination increases by 0.0964726. 
-        to explore some regions of the posterior. 
+
-        According to \cite{standevelopmentteam_RuntimeWarnings_2022} this is 
+
-        often due to thick tails of posterior distributions.
+% Pulled the data from the report
-    \item When we examine the results across different ICD-10 groups, 
+% ```{r}
-        \ref{fig:pred_dist_dif_delay2}
+% summary(pddf_ib$value) 
-        \todo{move figure from below}
+% Min.      1st Qu.   Median   Mean    3rd Qu. Max. 
-        we note this same issue.
+% -0.99850 -0.12919 -0.02259 0.09647 0.14531 1.00000 
-    \item In Figure \ref{fig:betas_delay}, we see that some some ICD-10 categories
+% quants <- quantile(pddf_ib$value, probs = seq(0,1,0.05), type=4) 
-        \todo{add figure}
+%    # Convert to a data frame 
-        have \todo{note fat tails}.
+%    quant_df <- data.frame( Percentile = names(quants),  Value = quants ) 
-    \item There are few trials available, particularly among some specific 
+%    kable(quant_df)
-        ICD-10 categories.
+%    Percentile Value 
-\end{itemize}
+%   SEE TABLE IN APPENDIX
-%           - take a look at beta values and then discuss if that lines up with results from dist-diff by group. 
+%```
-%       - My initial thought is that there is not enough data/too uncertain. I think this because it happens for most/all of the categories.
+
-% - 
+Figure \ref{fig:pred_dist_dif_delay2} shows how the different disease categories
-% - 
+tend to have a similar results:
 % - 
 Overally it is hard to escape the conclusion that more data is needed across
 many -- if not all -- of the disease categories.
 Figure \ref{fig:pred_dist_dif_delay2} shows how this overall
 result comes from different disease categories.
 \begin{figure}[H]
    \includegraphics[width=\textwidth]{../assets/img/dist_diff_analysis/p_delay_intervention_distdiff_by_group}
    \caption{Distribution of Predicted differences by Disease Group}
    \label{fig:pred_dist_dif_delay2}
 \end{figure}
 Again, note the high mass near zero, the general decrease in the probability
 of termination, and then the strong upper tails.
-
+Continuing to the $\beta$ parameters, 
 \subsection{Secondary Results}
 % Examine beta parameters 
 % - Little movement except where data is strong, general negative movement. Still really wide 
 % - Note how they all learned (partial pooling) reduction in \beta from ANR?
 % - Need to discuss the 5 different states. Can't remember which one is dropped for the life of me. May need to fix parameterization.
 % - 
 \begin{figure}[H]
    \includegraphics[width=\textwidth]{../assets/img/betas/parameter_across_groups/parameters_12_status_ANR}
    \caption{Distribution of parameters associated with ``Active, not recruiting'' status, by ICD-10 Category}
@ -227,147 +183,66 @@ result comes from different disease categories.
 \end{figure}
 % - 
-\subsection{Primary Results}
+Finally, in figure \ref{fig:parameters_ANR_by_group}, we can see the estimated distributions of the $\beta$ parameter for
-
+the status: \textbf{Active, not recruiting}.
-The primary, causally-identified value we can estimate is the change in 
+The prior distributions were centered on zero, but we can see that the pooled learning has moved the mean
-the probability of termination caused by (counterfactually) keeping enrollment
+values negative, representing reductions in the probability of termination across the board. 
-open instead of closing enrollment when observed. 
+This decrease in the probability of termination is strongest in the categories of Neoplasms ($n=49$),
-In figure \ref{fig:pred_dist_diff_delay} below, we see this impact of 
+Musculoskeletal diseases ($n=17$), and Infections and Parasites ($n=20$), the three categories with the most data.
-keeping enrollment open.
+As this is a comparison against the trial status XXX, we note that
-
+\todo{The natural comparison I want to make is against the Recruting status. Do I want to redo this so that I can read that directly?It shouldn't affect the $\delta_p$ analysis, but this could probably use it. YES, THIS UPDATE NEEDS TO HAPPEN. The base needs to be ``active not recruiting.''}
-
+Overall, this suggests that extending a clinical trial's enrollment period will reduce the probability of termination.
 \begin{figure}[H]
    \includegraphics[width=\textwidth]{../assets/img/dist_diff_analysis/p_delay_intervention_distdiff_boxplot}
    \small{
        Values near 1 indicate a near perfect increase in the probability 
        of termination. 
        Values near 0 indicate little change in probability,
        while values near -1, represent a decrease in the probability
        of termination. 
        The scale is in probability points, thus a value near 1 is a change 
        from unlikely to terminate under control, to highly likely to 
        terminate.
    }
    \caption{Histogram of the Distribution of Predicted Differences}
    \label{fig:pred_dist_diff_delay}
 \end{figure}
 There are a few interesting things to point out here. 
 Let's start by getting aquainted with the details of the distribution above.
 % - spike at 0
 % - the boxplot
 % - 63% of mass below 0 : find better way to say that
 %   - For a random trial, there is a 63% chance that the impact is to reduce the probability of a termination.
 % - 2 pctg-point wide band centered on 0 has ~13% of the masss
 % - mean represents 9.x% increase in probability of termination. A quick simulation gives about the same pctg-point increase in terminated trials.
 A few interesting interpretation bits come out of this.
 % - there are 3 regimes: low impact (near zero), medium impact (concentrated in decreased probability of termination), and high impact (concentrated in increased probability of termination). 
 The first this that there appear to be three different regimes. 
 The first regime consists of the low impact results, i.e. those values of $\delta_p$ 
 near zero. 
 About 13\% of trials lie within a single percentage point change of zero, 
 suggesting that there is a reasonable chance that delaying 
 a close of enrollment has no impact. 
 The second regime consists of the moderate impact on clinical trials'
 probabilities of termination, say values in the interval $[-0.5, 0.5]$ 
 on the graph.
 Most of this probability mass is represents a decrease in the probability of 
 a termination, some of it rather large.
 Finally, there exists the high impact region, almost exclusively concentrated 
 around increases in the probability of termination at $\delta_p > 0.75$. 
 These represent cases where delaying the close of enrollemnt changes a trial
 from a case where they were highly likely to complete their primary objectives to 
 a case where they were likely or almost certain to terminate the trial early.
 %   - the high impact regime is strange because it consists of trials that moved from unlikely (<20% chance) of termination to a high chance (>80% chance) of termination. Something like 5% of all trials have a greater than 98 percentage point increase in termination. Not sure what this is doing. 
 %   - Potential Explanations for high impact regime:
-How could this intervention have such a wide range in the intensity 
+This leads to the question:
-and direction of impacts?
+``How could this intervention have such a wide range in the intensity 
-A few explanations include that some trials are suceptable or that this is a 
+and direction of impacts?''
-result of too little data.
+The most likely explanations in my mind are that either
 some trials are highly suceptable to enrollment struggles or that this is a 
 modelling artifact.
 %       - Some trials are highly suceptable. This is the face value effect
-One option is that some categories are more suceptable to 
+The first option -- that some categories are more suceptable to 
-issues with participant enrollment. 
+issues with participant enrollment -- should allow us to 
-If this is the case, we should be able to isolate categories that contribute
+isolate categories or trials that contribute the most to this effect.
-the most to this effect.
+In figure 
-Another is that this might be a modelling artefact, due to the relatively
+\ref{fig:pred_dist_dif_delay2}, it appears that most of the trials have
-low number of trials in certain ICD-10 categories. 
+this high impact regime at $\delta_p > 0.75$.
 Another explanation is that this is a modelling artefact due to priors 
 with strong tails and the relatively low number of trials in 
 each ICD-10 categories.
 In short, there might be high levels of uncertanty in some parameter values,
 which manifest as fat tails in the distributions of the $\beta$ parameters. 
 Because of the logistic format of the model, these fat tails lead to 
 extreme values of $p$, and potentally large changes $\delta_p$. 
 %       - Could be uncertanty. If the model is highly uncertain, e.g. there isn't enough data, we could have a small percentage of large increases. This could be in general or just for a few categories with low amounts of data.
 % - 
 % - 
 I believe that this second explanation -- a model artifact due to uncertanty --
 is likely to be the cause. 
-Three points lead me to believe this:
+A few things lead me to believe this:
 \begin{itemize}
    \item The low fractions of E-BFMI suggest that the sampler is struggling 
        to explore some regions of the posterior. 
-        According to 
+        According to \cite{standevelopmentteam_RuntimeWarnings_2022} this is 
        \authorcite{standevelopmentteam_runtimewarningsconvergence_2022}
        this is 
        often due to thick tails of posterior distributions. 
-    \item When we examine the results across different ICD-10 groups, 
+        During earlier analysis, when I had about 100 trials, the number of 
        warnings was significantly higher.
    \item When we examine the results across different ICD-10 category, 
        \ref{fig:pred_dist_dif_delay2}
-        we note this same issue.
+        we note that most categories have the same upper tail spike.
-    \item In Figure \ref{fig:parameters_ANR_by_group}, we see that some 
+    \item In Figure 
-        ICD-10 categories have 
+        % \ref{fig:betas_delay}, 
-        \todo{note fat tails}.
+        \ref{fig:parameters_ANR_by_group},
-    \item There are few trials available, particularly among some specific 
+        we see that most ICD-10 categories
-        ICD-10 categories.
+        have fat tails in the $\beta$s, even among the categories 
-        \todo{refer to figure ??}
+        relatively larger sample sizes.
 \end{itemize}
 \todo{Reformat so this refers to the original discussion of issues better.}
 %           - take a look at beta values and then discuss if that lines up with results from dist-diff by group. 
 %       - My initial thought is that there is not enough data/too uncertain. I think this because it happens for most/all of the categories.
 % - 
 % - 
 % - 
 We can examine the per-group distributions of differences in \ref{fig:pred_dist_dif_delay2} to 
 acertain that the high impact group does exist in each of the groups.
 This lends credence to the idea that this is a modelling issue, potentially
 due to the low amounts of data overall.
 Figure \ref{fig:pred_dist_dif_delay2} shows how this overall
 result comes from different disease categories.
 \begin{figure}
    \includegraphics[width=\textwidth]{../assets/img/dist_diff_analysis/p_delay_intervention_distdiff_by_group}
    \caption{Distribution of Predicted differences by Disease Group}
    \label{fig:pred_dist_dif_delay2}
 \end{figure}
-% Examine beta parameters 
+\end{itemize}
 % - Little movement except where data is strong, general negative movement. Still really wide 
 % - Note how they all learned (partial pooling) reduction in \beta from ANR?
 % - Need to discuss the 5 different states. Can't remember which one is dropped for the life of me. May need to fix parameterization.
 % - 
 Finally, in figure \ref{fig:parameters_ANR_by_group}, we can see the estimated distributions of the $\beta$ parameter for
 the status: \textbf{Active, not recruiting}.
 The prior distributions were centered on zero, but we can see that the pooled learning has moved the mean
 values negative, representing reductions in the probability of termination across the board. 
 This decrease in the probability of termination is strongest in the categories of Neoplasms ($n=$),
 Musculoskeletal diseases ($n=$), and Infections and Parasites ($n=$), the three categories with the most data.
 As this is a comparison against the trial status XXX, we note that
 \todo{The natural comparison I want to make is against the Recruting status. Do I want to redo this so that I can read that directly?It shouldn't affect the $\delta_p$ analysis, but this could probably use it.}
 Overall, this suggests that extending a clinical trial's enrollment period will reduce the probability of termination.
 \begin{figure}[H]
    \includegraphics[width=\textwidth]{../assets/img/betas/parameter_across_groups/parameters_12_status_ANR}
    \caption{Distribution of parameters associated with ``Active, not recruiting'' status, by ICD-10 Category}
    \label{fig:parameters_ANR_by_group}
 \end{figure}
 % - 
-Overall it is hard to escape the conclusion that more data is needed across
+Overally it is hard to escape the conclusion that more data is needed across
 many -- if not all -- of the disease categories.
 At the same time, the median result is a decrease in the probability 
 of termination when the enrollment period is held open.
 \end{document}
--- a/Paper/sections/11_intro_and_lit.tex
+++ b/Paper/sections/11_intro_and_lit.tex
@ -31,7 +31,8 @@ one form of operational failure
 in Phase III clinical trials. 
 Using a novel dataset constructed from administrative data registered on 
 ClinicalTrials.gov, I exploit variation in enrollment timing and market
-conditions to identify how extending the enrollment period affects trial completion. 
+conditions to identify how extending the enrollment period 
 affects trial completion. 
 Specifically, I answer the question:
 \textit{
    ``How does the probability of trial termination change 
@ -43,199 +44,18 @@ pipeline and progression between clinical trial phases.
 % In 1938 President Franklin D Rosevelt signed the Food, Drug, and Cosmetic Act,
 % granting the Food and Drug Administration (FDA) authority to require 
 % pre-market approval of pharmaceuticals. 
 % \cite{commissioner_milestonesusfood_2023}
 % As of Sept 2022 \todo{Check Date} they have approved 6,602 currently-marketed 
 % compounds with Structured Product Labels (SPLs) 
 % and 10,983 previously-marketed SPLs
 % \cite{commissioner_nsde_2024},
 % %from nsde table. Get number of unique application_nubmers_or_citations with most recent end date as null.
 % In 1999, they began requiring that drug developers register and 
 % publish clinical trials on \url{https://clinicaltrials.gov}.
 % This provides a public mechanism where clinical trial sponsors are 
 % responsible to explain what they are trying to acheive and how it will be 
 % measured, as well as provide the public the ability to search and find trials 
 % that they might enroll in.
 % Multiple derived datasets such as the Cortellis Investigational Drugs dataset 
 % or the AACT dataset from the Clinical Trials Transformation Intiative
 % integrate these data. 
 % This brings up a question: 
 % Can we use this public data on clinical trials to identify what effects the 
 % success or failure of trials?
 % In this work, I use updates to records on 
 % \url{https://ClinicalTrials.gov} 
 % to do exactly that, disentangle the effect of participant enrollment 
 % and competing drugs on the market affect the success or failure of 
 % clinical trials.
 \subsection{Background}
 %Describe how clinical trials fit into the drug development landscape and how they proceed
 Clinical trials are a required part of drug development.
 Not only does the FDA require that a series of clinical trials demonstrate sufficient safety and efficacy of
 a novel pharmaceutical compound or device, producers of derivative medicines may be required to ensure that
 their generic small molecule compound -- such as ibuprofen or levothyroxine -- matches the
 performance of the originator drug if delivery or dosage is changed.
 For large molecule generics (termed biosimilars) such as Adalimumab
 (Brand name Humira, with biosimilars Abrilada, Amjevita, Cyltezo, Hadlima, Hulio,
 Hyrimoz, Idacio, Simlandi, Yuflyma, and Yusimry),
 the biosimilars are required to prove they have similar efficacy and safety to the
 reference drug.
 %TODO? Decide whether to include this or not
 %When registering these clinical trials
 % discuss how these are registered and what data is published.
 % Include image and discuss stages
 % Discuss challenges faced
 % Introduce my work
 In the world of drug development, these trials are classified into different 
 phases of development\footnote{
 \cite{anderson_fdadrugapproval_2022}
 provide an overview of this process
 while
 \cite{commissioner_drugdevelopmentprocess_2020}
 describes the process in detail.}.
 Pre-clinical studies primarily establish toxicity and potential dosing levels.
 % \cite{commissioner_drugdevelopmentprocess_2020}.
 Phase I trials are the first attempt to evaluate safety and efficacy in humans. 
 Participants typically are healthy individuals, and they measure how the drug 
 affects healthy bodies, potential side effects, and adjust dosing levels. 
 Sample sizes are often less than 100 participants. 
 % \cite{commissioner_drugdevelopmentprocess_2020}.
 Phase II trials typically involve a few hundred participants and is where 
 investigators will dial in dosing, research methods, and safety.
 % \cite{commissioner_drugdevelopmentprocess_2020}.
 A Phase III trial is the final trial before approval by the FDA, and is where 
 the investigator must demonstrate safety and efficacy with a large number of 
 participants, usually on the order of hundreds or thousands.
 % \cite{commissioner_drugdevelopmentprocess_2020}.
 Occasionally, a trial will be a multi-phase trial, covering aspects of either
 Phases I and II or Phases II and III. 
 After a successful Phase III trial, the sponsor will decide whether or not 
 to submit an application for approval from the FDA. 
 Before filing this application, the developer must have completed 
 ``two large, controlled clinical trials.''
 % \cite{commissioner_drugdevelopmentprocess_2020}.
 Phase IV trials are used after the drug has received marketing approval to 
 validate safety and efficacy in the general populace.
 Throughout this whole process, the FDA is available to assist in decision-making
 regarding topics such as study design, document review, and whether
 they should terminate the trial. 
 The FDA also reserves the right to place a hold on the clinical trial for 
 safety or other operational concerns, although this is rare. 
 \cite{commissioner_drugdevelopmentprocess_2020}.
 In the economics literature, most of the focus has been on describing how 
 drug candidates transition between different phases and their probability 
 of final approval.
 % Lead into lit review
 % Abrantes-Metz, Adams, Metz (2004)
 \authorcite{abrantes-metz_pharmaceuticaldevelopmentphases_2004}
 described the relationship between
 various drug characteristics and how the drug progressed through clinical trials.
 % This descriptive estimate was notable for using a 
 % mixed state proportional hazard model and estimating the impact of 
 % observed characteristics in each of the three phases.
 They found that as Phase I and II trials last longer, 
 the rate of failure increases. 
 In contrast, Phase 3 trials generally have a higher rate of 
 success than failure after 91 months.
 This may be due to the fact that the purpose of Phases I and II are different
 from the purpose of Phase III.
 Continuing on this theme,
 %DiMasi FeldmanSeckler Wilson 2009
 \authorcite{dimasi_trendsrisksassociated_2010}
 examine the completion rate of clinical drug 
 development and find that for the 50 largest drug producers, 
 approximately 19\% of their drugs under development between 1993 and 2004
 successfully moved from Phase I to receiving an New Drug Application (NDA) 
 or Biologics License Application (BLA). 
 They note a couple of changes in how drugs are developed over the years they 
 study, most notably that
 drugs began to fail earlier in their development cycle in the 
 latter half of the time they studied. 
 They note that this may reduce the cost of new drugs by eliminating late 
 and costly failures in the development pipeline.
 Earlier work by 
 \authorcite{dimasi_valueimprovingproductivity_2002}
 used data on 68 investigational drugs from 10 firms to simulate how reducing
 time in development reduces the costs of developing drugs. 
 He estimates that reducing Phase III of clinical trials by one year would 
 reduce total costs by about 8.9\% and that moving 5\% of clinical trial failures
 from phase III to Phase II would reduce out of pocket costs by 5.6\%. 
 A key contribution to this drug development literature is the work by 
 \authorcite{khmelnitskaya_competitionattritiondrug_2021}
 who created a causal identification strategy
 to disentangle strategic exits from exits due to clinical failures 
 in the drug development pipeline.
 She found that overall 8.4\% of all pipeline exits are due to strategic 
 terminations and that the rate of new drug production would be about 23\% 
 higher if those strategic terminatations were eliminated.
 The work that is closest to mine is the work by 
 \authorcite{hwang_failureinvestigationaldrugs_2016}
 who investigated causes for which late stage (Phase III)
 clinical trials fail -- with a focus on trials in the USA, 
 Europe, Japan, Canada, and Australia. 
 They identified 640 novel therapies and then studied each therapy's 
 development history, as outlined in commercial datasets.
 They found that for late stage trials that did not go on to receive approval,
 57\% failed on efficacy grounds, 17\% failed on safety grounds, and 22\% failed
 on commercial or other grounds.
 Unfortunately the work of both 
 \authorcite{hwang_failureinvestigationaldrugs_2016}
 and
 \authorcite{khmelnitskaya_competitionattritiondrug_2021}
 ignore a potentially large cause of failures: operational challenges, i.e. when
 issues running or funding the trial cause it to fail before achieving its 
 primary objective.
 In a personal review of 199 randomly selected clinical trials which terminated
 before achieving their primary objective,
 I found that 
 14.5\% cited safety or efficacy concerns, 
 9.1\% cited funding problems (an operational concern),
 and 
 31\% cited enrollment issues (a separate operational concern)\footnote{
 Note that these figures differ from 
 \authorcite{hwang_failureinvestigationaldrugs_2016}
 because I sampled from all stages of trials, not just Phase III trials
 focused on drug development.
 }.
 The main contribution of this work is the model I develop to separate 
 the causal effects of 
 market conditions (a strategic concern) from the effects of 
 participant enrollment (an operational concern) on Phase III Clinical trials. 
 This allows me to answer the question posed earlier:
 \textit{
    ``How does the probability of trial termination change 
    when the enrollment period is extended?''
 }
 using administrative data.
 To understand how I do this, we'll cover some background information on 
-clinical trials and the administrative data I collected in section 
+clinical trials, the current literature, 
-\ref{SEC:ClinicalTrials}, 
+and the administrative data I collected in section 
-explain the approach to causal identification, the required data,
+\ref{SEC:ClinicalTrials}.
-and describe how the data used matches these requirements in section 
+Then I'll
 explain the approach to causal identification and how the data collected
 matches those results,
 \ref{SEC:CausalAndData}. 
 Then we'll cover the econometric model 
 (section \ref{SEC:EconometricModel}) 
-and results (section 
+and results (section \ref{SEC:Results}). 
 \ref{SEC:Results}). 
 Finally, we acknowledge deficiencies in the analysis and potential improvements
 in section 
 \ref{SEC:Improvements},
--- a/Paper/sections/12_clinical_trial_background.tex
+++ b/Paper/sections/12_clinical_trial_background.tex
@ -92,6 +92,7 @@ or termination.
 Termination occurs after enrollment has begun but before achieving the 
 primary objective.
 Understanding why trials terminate early is the key goal of this work, but
 is not straightforward.
 Terminated trials typically record a 
@ -109,7 +110,8 @@ led to the termination, leaving us to
 use another way to infer the relative impact of operational difficulties.
-To better descrobe termination causes, I suggest classifying them into 
+\todo{move the following}
 To better describe termination causes, I suggest classifying them into 
 three broad categories. 
 The first category, Safety or Efficacy concerns, occurs when data suggests 
 the treatment is unsafe or unlikely to achieve its therapeutic goals. 
@ -127,7 +129,152 @@ These latter two categories represent true failures of the trial process,
 as they prevent us from learning whether the treatment would have 
 been safe and effective.
-\subsection{Data Summary}
+
 \subsection{Literature on Clinical Trials}\label{SEC:LitReview}
 %Describe how clinical trials fit into the drug development landscape and how they proceed
 Clinical trials are a required part of drug development.
 Not only does the FDA require that a series of clinical trials demonstrate sufficient safety and efficacy of
 a novel pharmaceutical compound or device, producers of derivative medicines may be required to ensure that
 their generic small molecule compound -- such as ibuprofen or levothyroxine -- matches the
 performance of the originator drug if delivery or dosage is changed.
 For large molecule generics (termed biosimilars) such as Adalimumab
 (Brand name Humira, with biosimilars Abrilada, Amjevita, Cyltezo, Hadlima, Hulio,
 Hyrimoz, Idacio, Simlandi, Yuflyma, and Yusimry),
 the biosimilars are required to prove they have similar efficacy and safety to the
 reference drug.
 In the world of drug development, these trials are classified into different 
 phases of development\footnote{
 \cite{anderson_fdadrugapproval_2022}
 provide an overview of this process
 while
 \cite{commissioner_drugdevelopmentprocess_2020}
 describes the process in detail.}.
 Pre-clinical studies primarily establish toxicity and potential dosing levels.
 % \cite{commissioner_drugdevelopmentprocess_2020}.
 Phase I trials are the first attempt to evaluate safety and efficacy in humans. 
 Participants typically are healthy individuals, and they measure how the drug 
 affects healthy bodies, potential side effects, and adjust dosing levels. 
 Sample sizes are often less than 100 participants. 
 % \cite{commissioner_drugdevelopmentprocess_2020}.
 Phase II trials typically involve a few hundred participants and is where 
 investigators will dial in dosing, research methods, and safety.
 % \cite{commissioner_drugdevelopmentprocess_2020}.
 A Phase III trial is the final trial before approval by the FDA, and is where 
 the investigator must demonstrate safety and efficacy with a large number of 
 participants, usually on the order of hundreds or thousands.
 % \cite{commissioner_drugdevelopmentprocess_2020}.
 Occasionally, a trial will be a multi-phase trial, covering aspects of either
 Phases I and II or Phases II and III. 
 After a successful Phase III trial, the sponsor will decide whether or not 
 to submit an application for approval from the FDA. 
 Before filing this application, the developer must have completed 
 ``two large, controlled clinical trials.''
 % \cite{commissioner_drugdevelopmentprocess_2020}.
 Phase IV trials are used after the drug has received marketing approval to 
 validate safety and efficacy in the general populace.
 Throughout this whole process, the FDA is available to assist in decision-making
 regarding topics such as study design, document review, and whether
 they should terminate the trial. 
 The FDA also reserves the right to place a hold on the clinical trial for 
 safety or other operational concerns, although this is rare. 
 \cite{commissioner_drugdevelopmentprocess_2020}.
 In the economics literature, most of the focus has been on describing how 
 drug candidates transition between different phases and their probability 
 of final approval.
 % Lead into lit review
 % Abrantes-Metz, Adams, Metz (2004)
 \authorcite{abrantes-metz_pharmaceuticaldevelopmentphases_2004}
 described the relationship between
 various drug characteristics and how the drug progressed through clinical trials.
 % This descriptive estimate was notable for using a 
 % mixed state proportional hazard model and estimating the impact of 
 % observed characteristics in each of the three phases.
 They found that as Phase I and II trials last longer, 
 the rate of failure increases. 
 In contrast, Phase 3 trials generally have a higher rate of 
 success than failure after 91 months.
 This may be due to the fact that the purpose of Phases I and II are different
 from the purpose of Phase III.
 Continuing on this theme,
 %DiMasi FeldmanSeckler Wilson 2009
 \authorcite{dimasi_trendsrisksassociated_2010}
 examine the completion rate of clinical drug 
 development and find that for the 50 largest drug producers, 
 approximately 19\% of their drugs under development between 1993 and 2004
 successfully moved from Phase I to receiving an New Drug Application (NDA) 
 or Biologics License Application (BLA). 
 They note a couple of changes in how drugs are developed over the years they 
 study, most notably that
 drugs began to fail earlier in their development cycle in the 
 latter half of the time they studied. 
 They note that this may reduce the cost of new drugs by eliminating late 
 and costly failures in the development pipeline.
 Earlier work by 
 \authorcite{dimasi_valueimprovingproductivity_2002}
 used data on 68 investigational drugs from 10 firms to simulate how reducing
 time in development reduces the costs of developing drugs. 
 He estimates that reducing Phase III of clinical trials by one year would 
 reduce total costs by about 8.9\% and that moving 5\% of clinical trial failures
 from phase III to Phase II would reduce out of pocket costs by 5.6\%. 
 A key contribution to this drug development literature is the work by 
 \authorcite{khmelnitskaya_competitionattritiondrug_2021}
 who created a causal identification strategy
 to disentangle strategic exits from exits due to clinical failures 
 in the drug development pipeline.
 She found that overall 8.4\% of all pipeline exits are due to strategic 
 terminations and that the rate of new drug production would be about 23\% 
 higher if those strategic terminatations were eliminated.
 The work that is closest to mine is the work by 
 \authorcite{hwang_failureinvestigationaldrugs_2016}
 who investigated causes for which late stage (Phase III)
 clinical trials fail -- with a focus on trials in the USA, 
 Europe, Japan, Canada, and Australia. 
 They identified 640 novel therapies and then studied each therapy's 
 development history, as outlined in commercial datasets.
 They found that for late stage trials that did not go on to receive approval,
 57\% failed on efficacy grounds, 17\% failed on safety grounds, and 22\% failed
 on commercial or other grounds.
 Unfortunately the work of both 
 \authorcite{hwang_failureinvestigationaldrugs_2016}
 and
 \authorcite{khmelnitskaya_competitionattritiondrug_2021}
 ignore a potentially large cause of failures: operational challenges, i.e. when
 issues running or funding the trial cause it to fail before achieving its 
 primary objective.
 In a personal review of 199 randomly selected clinical trials which terminated
 before achieving their primary objective,
 I found that 
 14.5\% cited safety or efficacy concerns, 
 9.1\% cited funding problems (an operational concern),
 and 
 31\% cited enrollment issues (a separate operational concern)\footnote{
 Note that these figures differ from 
 \authorcite{hwang_failureinvestigationaldrugs_2016}
 because I sampled from all stages of trials, not just Phase III trials
 focused on drug development.
 }.
 \subsection{Introduction to \href{https://ClinicalTrials.gov}{ClinicalTrials.Gov}}
 %% Describe data here
 Since Sep 27th, 2007 those who conduct clinical trials of FDA controlled 
 drugs or devices on human subjects must register 
@ -176,13 +323,18 @@ information about the past state of trials.
 I combined these two sources, using the AACT dataset to select 
 trials of interest and then scraping \url{ClinicalTrials.gov} to get 
 a timeline of each trial.
 The result is a series of snapshots, each documenting a specific set of 
 recorded changes in a trial. 
 It is these snapshots that provide the opportunity to estimate the 
 data generating process corresponding to the clinical trials for 
 which I have data.
 %%%%%%%%%%%%%%%%%%%%%%%% Model Outline
-The way I use this data is to predict the final status of the trial 
+% The way I use this data is to predict the final status of the trial 
-from the snapshots that were taken, in effect asking:
+% from the snapshots that were taken, in effect asking:
-``how does the probability of a termination change from the current state 
+% ``how does the probability of a termination change from the current state 
-of the trial if X changes?''
+% of the trial if X changes?''
 % - 
 % - 
 % - 
--- a/Paper/sections/22_appendix_full_results.tex
+++ b/Paper/sections/22_appendix_full_results.tex
@ -0,0 +1,39 @@
 \documentclass[../Main.tex]{subfiles}
 \graphicspath{{\subfix{Assets/img/}}}
 \begin{document}
 \begin{center}
    \label{TABLE:PercentilesOfDistributionOfDifferences}
    % \caption{Table of Percentiles of Distribution of Differences}
    \begin{tabular}{cc}
        \hline
        Percentile & Value \\
        \hline
        0\% & -0.9985020 \\
        5\% & -0.3763454 \\
        10\% & -0.2639654 \\
        15\% & -0.2053399 \\
        20\% & -0.1628793 \\
        25\% & -0.1291890 \\
        30\% & -0.0980523 \\
        35\% & -0.0734082 \\
        40\% & -0.0547123 \\
        45\% & -0.0385514 \\
        50\% & -0.0225949 \\
        55\% & -0.0045955 \\
        60\% & -0.0000394 \\
        65\% & 0.0010549 \\
        70\% & 0.0509626 \\
        75\% & 0.1453046 \\
        80\% & 0.3425234 \\
        85\% & 0.7084837 \\
        90\% & 0.9250351 \\
        95\% & 0.9820456 \\
        100\% & 1.0000000 \\
        \hline
    \end{tabular}
 \end{center}
 \end{document}
--- a/assets/preambles/References.bib
+++ b/assets/preambles/References.bib
@ -5355,7 +5355,7 @@ California 90401-3208},
  file = {/home/will/Zotero/storage/KAHW2ABD/Indexing-SPL-Fact-Sheet.pdf}
 }
-@online{usnlm_fdaaa800finalrule,
+@online{usnlm_fdaaa801finalrule,
  type = {Government},
  title = {{{FDAAA}} 801 and the {{Final Rule}} - {{ClinicalTrials}}.Gov},
  author = {{U.S. National Library of Medicine}},
--- a/logs.org
+++ b/logs.org
@ -5,3 +5,11 @@
    Need to decide whether or not to include this set of sentences.
 **** [2025-01-18 Sat 11:58] [[[[file:/home/will/research/phd_deliverables/JobMarketPaper/Paper/sections/11_intro_and_lit.tex::45]]]] 
    decide whether to include these details here
 ** 2025-W05
 *** 2025-01-29 Wednesday
 **** [2025-01-29 Wed 10:12] Summary of yesterday, thoughts for today
     Yesterday I got my draft mostly done. I rearranged the causal inference section
     fixed some references, etc. 
     Today I want to remove a bunch of todos, read it backwards to fix things,
     and get it sent to Tom.
     I'll also run it by claude.ai.
--- a/todo.org
+++ b/todo.org
@ -4,19 +4,19 @@
 **** DONE Push work to overleaf
     DEADLINE: <2025-01-15 Wed> CLOSED: [2025-01-20 Mon 11:46]
 *** 2025-01-17 Friday
-**** TODO Redo analysis using "Recruitng" as the base status
+**** DONE Fix JMP based on Tom's Suggestions and send to committee
-     
+     CLOSED: [2025-01-29 Wed 10:11]
-     The goal is to get the $\beta$'s for active, not recruitng.
+***** DONE Get references working properly 
-**** TODO Fix JMP based on Tom's Suggestions and send to committee
+      CLOSED: [2025-01-29 Wed 09:58]
 ***** TODO Get references working properly 
      - setup author date format
      - fix references, add to Overleaf version
-***** TODO Read Backward
+***** DONE fix issues
-      Identify poorly written portions (incomplete sentences and paragraphs) and what I was trying to communicate.
+      CLOSED: [2025-01-29 Wed 09:58]
 ***** TODO fix issues
 *** 2025-01-18 Saturday
-**** TODO Decide if this section needs added   
+**** DONE Decide if this section needs added   
     CLOSED: [2025-01-29 Wed 09:58]
    [[[[file:/home/will/research/phd_deliverables/JobMarketPaper/Paper/sections/11_intro_and_lit.tex::45]]]] 
     nope
 **** RECINDED Update citations in lit review section.  
     CLOSED: [2025-01-20 Mon 11:47]
    [[[[file:/home/will/research/phd_deliverables/JobMarketPaper/Paper/sections/05_LitReview.tex::25]]]] 
@ -31,8 +31,19 @@
     Realized that this was readded by mistake. I integrated lit review into intro in 11
 ** 2025-W04
 *** 2025-01-20 Monday
-**** TODO get a citation for the AACT project  
+**** DONE get a citation for the AACT project  
     CLOSED: [2025-01-29 Wed 09:55]
    [[[[file:/home/will/research/phd_deliverables/JobMarketPaper/Paper/sections/10_CausalStory.tex::114]]]] 
 *** 2025-01-23 Thursday
-**** TODO Pickup citation fixes here  
+**** DONE Pickup citation fixes here  
     CLOSED: [2025-01-29 Wed 09:55]
    [[[[file:/home/will/research/phd_deliverables/JobMarketPaper/Paper/sections/06_Results.tex::174]]]] 
 ** 2025-W05
 *** 2025-01-29 Wednesday
 **** TODO Review JMP, list areas that need rewritten.
 ***** TODO Read Backward
      Identify poorly written portions (incomplete sentences and paragraphs) and what I was trying to communicate.
 **** TODO Redo analysis using "Recruitng" as the base status
     The goal is to get the $\beta$'s for active, not recruitng.