Skip to main content

Questions tagged [regression]

Techniques for analyzing the relationship between one (or more) "dependent" variables and "independent" variables.

Filter by
Sorted by
Tagged with
2 votes
1 answer
69 views

I’m trying to use the R poly() function with degree 1 to force glm to interpret a factor linearly. I’m puzzled by the fact that the size of the sample seems to increase the coefficient of the ...
Guillaume's user avatar
1 vote
0 answers
47 views

I am going through the creation of a prediction interval for a value drawn from the conditional distribution of $Y$ given $X=x$ under simple linear regression as shown in the image above. The ...
froot's user avatar
  • 83
0 votes
0 answers
26 views

I am trying to perform a Monte-Carlo simulation on quantile regression using R. Currently I am getting stuck simulating the data from the model below. ...
UNI39's user avatar
  • 11
4 votes
2 answers
109 views
+100

I’m modeling mortality using a multivariate logistic regression model with a nonlinear effect of X1 and I’m examining whether this relationship changes across ...
Konstantinos Gkirgkiris's user avatar
0 votes
1 answer
47 views

I am attempting to understand how each independent variable effects the probability of each dependent variable, which are ordinal (0, 1 and 2). Therefore, I am attempting to use ordinal logistic ...
92carmnad's user avatar
4 votes
4 answers
318 views
+50

I’m working on a logistic regression model where I want to examine whether the effect of one continuous predictor (X1) on a binary outcome depends on another ...
Konstantinos Gkirgkiris's user avatar
1 vote
1 answer
35 views

I'm running a linear mixed model, in which I have included a few categorical variables - time, sex - with two levels, as well as three continuous nutrition variables as fixed effects and their ...
Ekiboi's user avatar
  • 41
0 votes
0 answers
30 views

I am investigating the influence of policy X on grade outcomes. Earlier research was able to utilise a partial implementation of policy X in the population of interest to establish a natural ...
SharpShimmer's user avatar
1 vote
0 answers
60 views

I’m trying to get a better grasp of how to handle an issue in pre–post observational data. Let’s say I have data from a rehab center with measures at admission and discharge (only these two ...
querent's user avatar
  • 11
0 votes
0 answers
60 views

My data is a ratio of: perceived time elapsed/actual time elapsed. Now this ranges from 0 to +infinity. It a continuous positive number. My experiment is mixed model (with within and between subject ...
the_bluestreak's user avatar
5 votes
1 answer
216 views

I’m trying to understand how natural cubic splines (splines::ns) and restricted cubic splines (rms::rcs) handle knots — ...
Konstantinos Gkirgkiris's user avatar
2 votes
0 answers
41 views

I'm new to using GLMs which are not Linear Regression, and am working on a project where I am using Gamma regression with a log-link. I'm having problems with the feature engineering step. With linear ...
michael james's user avatar
3 votes
2 answers
159 views

I am trying to understand ordered factors (polynomial terms) and their interpretation in Cox Proportional Hazards regression model. I know when using lm() to fit ...
SIO's user avatar
  • 133
0 votes
0 answers
38 views

When conducting maximum likelihood estimation for simple linear regression whilst considering the regressors as random, the joint distribution of $f_{X,Y}(x,y;\theta) = f_{Y|X}(y|x;\theta) * f_{X}(x;\...
froot's user avatar
  • 83
1 vote
0 answers
49 views

community. I'm facing a modeling problem for cash flow forecasting and would like to know what the most robust mathematical/statistical approach is to solve it. The Problem: Debt Recovery Forecasting ...
sn3fru's user avatar
  • 215
1 vote
1 answer
164 views

Does the strict exogeneity assumption of OLS $ \mathbb{E} [\epsilon \mid X ] = 0 $ imply that the error terms of different observations are uncorrelated with one another, that is $ \text{Cov}( \...
robertspierre's user avatar
2 votes
2 answers
99 views

I can't seem to wrap my head around this: What is the glm() equivalent for lm(log(y) ~ x1 + x2, data=data)? Is it? a. ...
Mubita 's user avatar
  • 121
1 vote
1 answer
93 views

I would like to check for multicollinearity of the independent variables in a binary logistic regression. Some independent variables are binary (coded 0, 1), others are polytomous (converted to dummy) ...
José Luis's user avatar
0 votes
0 answers
19 views

Suppose I have a slow model with accuracy of between 75 and 80 %. I want to approximate this model with faster models. Fast models require $e$ effort and the more effort the better. I want to estimate ...
Gaslight Deceive Subvert's user avatar
2 votes
1 answer
240 views

The question titled “How are the standard errors of coefficients calculated in a regression?” is asking how the standard errors of regression coefficient estimates are computed (for example, the ...
Laut567's user avatar
  • 83
0 votes
1 answer
28 views

In my first course on linear regression, I learned the 4 basic assumptions that every textbook teaches: linearity, independence, homoscedasticity, and normality. However, I recently learned about ...
Iterator516's user avatar
5 votes
1 answer
122 views

I have two models trained and validated on the same training/validation data. Now I need to evaluate them on multiple independent test datasets (e.g., 10 different datasets of the same measure). Which ...
user26416177's user avatar
2 votes
1 answer
110 views

The equation for AIC is $$\mathrm{AIC} = n\ln(\mathrm{MSE})+2k$$ where:   $n ={}$number of observations   $\mathrm{MSE} ={}$mean squared error   $k ={}$number of parameter estimates The way I ...
doubtful_noob's user avatar
1 vote
0 answers
110 views

Consider a sample of $N/2$ pairs of individuals. Each pair belongs to a group $j$. For each individual $i$ from the $N$ sample, I measure two variables ($y_{i}$ and $x_{i}$) and the average per group $...
CafféSospeso's user avatar
0 votes
0 answers
41 views

I am analyzing the relationship between age, education, and the probability of having a high income (>50K) using data from the UCI Adult dataset. I've fit a logistic regression model with a natural ...
Konstantinos Gkirgkiris's user avatar

1
2 3 4 5
616