Articles by Keith Goldfeld

A Bayesian proportional hazards model with a penalized spline

March 3, 2025 | Keith Goldfeld

In my previous post, I outlined a Bayesian approach to proportional hazards modeling. This post serves as an addendum, providing code to incorporate a spline to model a time-varying hazard ratio non linearly. In a second addendum to come I will pres...

[Read more...]

Estimating a Bayesian proportional hazards model

February 10, 2025 | Keith Goldfeld

A recent conversation with a colleague about a large stepped-wedge design (SW-CRT) cluster randomized trial piqued my interest, because the primary outcome is time-to-event. This is not something I’ve seen before. A quick dive into the literature su...

[Read more...]

Thinking about covariates in an analysis of an RCT

January 27, 2025 | Keith Goldfeld

I was recently discussing the analytic plan for a randomized controlled trial (RCT) with a clinical collaborator when she asked whether it’s appropriate to adjust for pre-specified baseline covariates. This question is so interesting because it touc... [Read more...]

Can ChatGPT help construct non-trivial statistical models? An example with Bayesian “random” splines

October 7, 2024 | Keith Goldfeld

I’ve been curious to see how helpful ChatGPT can be for implementing relatively complicated models in R. About two years ago, I described a model for estimating a treatment effect in a cluster-randomized stepped wedge trial. We used a generalized ad...

[Read more...]

An IV study design to estimate an effect size when randomization is not ethical

September 2, 2024 | Keith Goldfeld

An investigator I frequently consult with seeks to estimate the effect of a palliative care treatment protocol for patients nearing end-stage disease, compared to a more standard, though potentially overly burdensome, therapeutic approach. Ideally, ...

[Read more...]

Generating binary data by specifying the relative risk, with simulations

July 1, 2024 | Keith Goldfeld

The most traditional approach for analyzing binary outcome data is logistic regression, where the estimated parameters are interpreted as log odds ratios or, if exponentiated, as odds ratios (ORs). No one other than statisticians (and maybe not even statisticians) finds the odds ratio to be a very intuitive statistic, and ... [Read more...]

simstudy: another way to generate data from a non-standard density

June 3, 2024 | Keith Goldfeld

One of my goals for the simstudy package is to make it as easy as possible to generate data from a wide range of data distributions. The recent update created the possibility of generating data from a customized distribution specified in a user-defi...

[Read more...]

simstudy 0.8.0: customized distributions

May 20, 2024 | Keith Goldfeld

Over the past few years, a number of folks have asked if simstudy accommodates customized distributions. There’s been interest in truncated, zero-inflated, or even more standard distributions that haven’t been implemented in simstudy. While I’ve com...

[Read more...]

simstudy enhancement: specifying idiosyncratic follow-up times for longitudinal data

April 15, 2024 | Keith Goldfeld

A researcher reached out to me a few weeks ago. They were trying to generate longitudinal data that included irregularly spaced follow-up periods. The default periods generated by the function addPeriods in the simstudy package are \(\{0, 1, 2, ...,...

[Read more...]

Perfectly balanced treatment arm distribution in a multifactorial CRT using stratified randomization

February 19, 2024 | Keith Goldfeld

Over two years ago, I wrote a series of posts (starting here) that described possible analytic approaches for a proposed cluster-randomized trial with a factorial design. That proposal was recently funded by NIA/NIH, and now the Emergency department...

[Read more...]

A three-arm trial using two-step randomization

December 18, 2023 | Keith Goldfeld

Clinical Decision Support (CDS) tools are systems created to support clinical decision-making. Health care professionals using these tools can get guidance about diagnostic and treatment options when providing care to a patient. I’m currently involved with designing a trial focused on comparing a standard CDS tool with an enhanced ...

[Read more...]

Creating a nice looking Table 1 with standardized mean differences

September 25, 2023 | Keith Goldfeld

I’m in the middle of a perfect storm, winding down three randomized clinical trials (RCTs), with patient recruitment long finished and data collection all wrapped up. This means a lot of data analysis, presentation prep, and paper writing (and not so much blogging). One common (and not so glamorous) ... [Read more...]

Finding logistic models to generate data with desired risk ratio, risk difference and AUC profiles

June 19, 2023 | Keith Goldfeld

About two years ago, someone inquired whether simstudy had the functionality to generate data from a logistic model with a specific AUC. It did not, but now it does, thanks to a paper by Peter Austin that describes a nice algorithm to accomplish thi...

[Read more...]

A demo of power estimation by simulation for a cluster randomized trial with a time-to-event outcome

May 22, 2023 | Keith Goldfeld

A colleague reached out for help designing a cluster randomized trial to evaluate a clinical decision support tool for primary care physicians (PCPs), which aims to improve care for high-risk patients. The outcome will be a time-to-event mea...

[Read more...]

Generating variable cluster sizes to assess power in cluster randomize trials

April 17, 2023 | Keith Goldfeld

In recent discussions with a number of collaborators at the NIA IMPACT Collaboratory about setting the sample size for a proposed cluster randomized trial, the question of variable cluster sizes has come up a number of times. Given a fixed overall s...

[Read more...]

Implementing a one-step GEE algorithm for very large cluster sizes in R

March 20, 2023 | Keith Goldfeld

Very large data sets can present estimation problems for some statistical models, particularly ones that cannot avoid matrix inversion. For example, generalized estimating equations (GEE) models that are used when individual observations are correla... [Read more...]

simstudy 0.6.0 released: more flexible correlation patterns

February 20, 2023 | Keith Goldfeld

The new version (0.6.0) of simstudy is available for download from CRAN. In addition to some important bug fixes, I’ve added new functionality that should make data generation with correlated data a little more flexible. In the previous post, I desc... [Read more...]

Flexible correlation generation: an update to genCorMat in simstudy

February 13, 2023 | Keith Goldfeld

I’ve been slowly working on some updates to simstudy, focusing mostly on the functionality to generate correlation matrices (which can be used to simulate correlated data). Here, I’m briefly describing the function genCorMat, which has been updated ... [Read more...]

A GAM for time trends in a stepped-wedge trial with a binary outcome

January 16, 2023 | Keith Goldfeld

In a previous post, I described some ways one might go about analyzing data from a stepped-wedge, cluster-randomized trial using a generalized additive model (a GAM), focusing on continuous outcomes. I have spent the past few weeks developing a simi...

[Read more...]

Modeling the secular trend in a stepped-wedge design

December 12, 2022 | Keith Goldfeld

Recently I started a discussion about modeling secular trends using flexible models in the context of cluster randomized trials. I’ve been motivated by a trial I am involved with that is using a stepped-wedge study design. The initial post focused o...

[Read more...]

1 2 3 … 8 »

R-bloggers

R news and tutorials contributed by hundreds of R bloggers

Articles by Keith Goldfeld

A Bayesian proportional hazards model with a penalized spline

Estimating a Bayesian proportional hazards model

Thinking about covariates in an analysis of an RCT

Can ChatGPT help construct non-trivial statistical models? An example with Bayesian “random” splines

An IV study design to estimate an effect size when randomization is not ethical

Generating binary data by specifying the relative risk, with simulations

simstudy: another way to generate data from a non-standard density

simstudy 0.8.0: customized distributions

simstudy enhancement: specifying idiosyncratic follow-up times for longitudinal data

Perfectly balanced treatment arm distribution in a multifactorial CRT using stratified randomization

A three-arm trial using two-step randomization

Creating a nice looking Table 1 with standardized mean differences

Finding logistic models to generate data with desired risk ratio, risk difference and AUC profiles

A demo of power estimation by simulation for a cluster randomized trial with a time-to-event outcome

Generating variable cluster sizes to assess power in cluster randomize trials

Implementing a one-step GEE algorithm for very large cluster sizes in R

simstudy 0.6.0 released: more flexible correlation patterns

Flexible correlation generation: an update to genCorMat in simstudy

A GAM for time trends in a stepped-wedge trial with a binary outcome

Modeling the secular trend in a stepped-wedge design