Statsmodels OLS 'inf or NaN' error when there is none in dataset
My data is looks like this:
58.445355 97.668693 126.101506 192.417611 221.809156 289.785785 t0 x5
60.451638 101.626297 130.027950 191.922363 219.977219 296.777197 11 x88
61.961283 104.692036 132.872274 197.605787 225.731945 313.789832 t2 x4
The columns are num1
to num6
and then t
and x
import statsmodels.api as sm
from statsmodels.formula.api import ols
from statsmodels.stats.anova import anova_lm
formula = 'num4 ~ C(t) + C(x) + C(t):C(x)'
model = ols(formula, data).fit()
aov_table = sm.stats.anova_lm(model, typ=2)
This leads to this error:
ValueError: array must not contain infs or NaNs
Though my data frame has no Infs or NANs.
What am I doing wrong?
P.S. I have done the exact analysis successfully using aov
function in R without running into any issues.
Diagnosis
data.isnull().sum()
num1 0
num2 0
num3 0
num4 0
num5 0
num6 0
t 0
x 0
dtype: int64
python pandas statistics statsmodels anova
|
show 6 more comments
My data is looks like this:
58.445355 97.668693 126.101506 192.417611 221.809156 289.785785 t0 x5
60.451638 101.626297 130.027950 191.922363 219.977219 296.777197 11 x88
61.961283 104.692036 132.872274 197.605787 225.731945 313.789832 t2 x4
The columns are num1
to num6
and then t
and x
import statsmodels.api as sm
from statsmodels.formula.api import ols
from statsmodels.stats.anova import anova_lm
formula = 'num4 ~ C(t) + C(x) + C(t):C(x)'
model = ols(formula, data).fit()
aov_table = sm.stats.anova_lm(model, typ=2)
This leads to this error:
ValueError: array must not contain infs or NaNs
Though my data frame has no Infs or NANs.
What am I doing wrong?
P.S. I have done the exact analysis successfully using aov
function in R without running into any issues.
Diagnosis
data.isnull().sum()
num1 0
num2 0
num3 0
num4 0
num5 0
num6 0
t 0
x 0
dtype: int64
python pandas statistics statsmodels anova
NAN stands for "not a number", what is something liket0
orx5
if not a nan?
– enumaris
Nov 26 '18 at 21:11
Sounds like your columns at the end are being cast to a numeric type on accident.
– TheIncorrigible1
Nov 26 '18 at 21:11
@enumaris True. But I am doing an ANOVA analysis where certain variables will not be numbers. Any suggestions?
– maximusdooku
Nov 26 '18 at 21:14
@TheIncorrigible1 How can I force it to retain it as character?
– maximusdooku
Nov 26 '18 at 21:14
What is the output ofdata.isnull().sum()
– ALollz
Nov 26 '18 at 21:14
|
show 6 more comments
My data is looks like this:
58.445355 97.668693 126.101506 192.417611 221.809156 289.785785 t0 x5
60.451638 101.626297 130.027950 191.922363 219.977219 296.777197 11 x88
61.961283 104.692036 132.872274 197.605787 225.731945 313.789832 t2 x4
The columns are num1
to num6
and then t
and x
import statsmodels.api as sm
from statsmodels.formula.api import ols
from statsmodels.stats.anova import anova_lm
formula = 'num4 ~ C(t) + C(x) + C(t):C(x)'
model = ols(formula, data).fit()
aov_table = sm.stats.anova_lm(model, typ=2)
This leads to this error:
ValueError: array must not contain infs or NaNs
Though my data frame has no Infs or NANs.
What am I doing wrong?
P.S. I have done the exact analysis successfully using aov
function in R without running into any issues.
Diagnosis
data.isnull().sum()
num1 0
num2 0
num3 0
num4 0
num5 0
num6 0
t 0
x 0
dtype: int64
python pandas statistics statsmodels anova
My data is looks like this:
58.445355 97.668693 126.101506 192.417611 221.809156 289.785785 t0 x5
60.451638 101.626297 130.027950 191.922363 219.977219 296.777197 11 x88
61.961283 104.692036 132.872274 197.605787 225.731945 313.789832 t2 x4
The columns are num1
to num6
and then t
and x
import statsmodels.api as sm
from statsmodels.formula.api import ols
from statsmodels.stats.anova import anova_lm
formula = 'num4 ~ C(t) + C(x) + C(t):C(x)'
model = ols(formula, data).fit()
aov_table = sm.stats.anova_lm(model, typ=2)
This leads to this error:
ValueError: array must not contain infs or NaNs
Though my data frame has no Infs or NANs.
What am I doing wrong?
P.S. I have done the exact analysis successfully using aov
function in R without running into any issues.
Diagnosis
data.isnull().sum()
num1 0
num2 0
num3 0
num4 0
num5 0
num6 0
t 0
x 0
dtype: int64
python pandas statistics statsmodels anova
python pandas statistics statsmodels anova
edited Nov 27 '18 at 18:52
jtweeder
33418
33418
asked Nov 26 '18 at 21:09
maximusdookumaximusdooku
1,48221445
1,48221445
NAN stands for "not a number", what is something liket0
orx5
if not a nan?
– enumaris
Nov 26 '18 at 21:11
Sounds like your columns at the end are being cast to a numeric type on accident.
– TheIncorrigible1
Nov 26 '18 at 21:11
@enumaris True. But I am doing an ANOVA analysis where certain variables will not be numbers. Any suggestions?
– maximusdooku
Nov 26 '18 at 21:14
@TheIncorrigible1 How can I force it to retain it as character?
– maximusdooku
Nov 26 '18 at 21:14
What is the output ofdata.isnull().sum()
– ALollz
Nov 26 '18 at 21:14
|
show 6 more comments
NAN stands for "not a number", what is something liket0
orx5
if not a nan?
– enumaris
Nov 26 '18 at 21:11
Sounds like your columns at the end are being cast to a numeric type on accident.
– TheIncorrigible1
Nov 26 '18 at 21:11
@enumaris True. But I am doing an ANOVA analysis where certain variables will not be numbers. Any suggestions?
– maximusdooku
Nov 26 '18 at 21:14
@TheIncorrigible1 How can I force it to retain it as character?
– maximusdooku
Nov 26 '18 at 21:14
What is the output ofdata.isnull().sum()
– ALollz
Nov 26 '18 at 21:14
NAN stands for "not a number", what is something like
t0
or x5
if not a nan?– enumaris
Nov 26 '18 at 21:11
NAN stands for "not a number", what is something like
t0
or x5
if not a nan?– enumaris
Nov 26 '18 at 21:11
Sounds like your columns at the end are being cast to a numeric type on accident.
– TheIncorrigible1
Nov 26 '18 at 21:11
Sounds like your columns at the end are being cast to a numeric type on accident.
– TheIncorrigible1
Nov 26 '18 at 21:11
@enumaris True. But I am doing an ANOVA analysis where certain variables will not be numbers. Any suggestions?
– maximusdooku
Nov 26 '18 at 21:14
@enumaris True. But I am doing an ANOVA analysis where certain variables will not be numbers. Any suggestions?
– maximusdooku
Nov 26 '18 at 21:14
@TheIncorrigible1 How can I force it to retain it as character?
– maximusdooku
Nov 26 '18 at 21:14
@TheIncorrigible1 How can I force it to retain it as character?
– maximusdooku
Nov 26 '18 at 21:14
What is the output of
data.isnull().sum()
– ALollz
Nov 26 '18 at 21:14
What is the output of
data.isnull().sum()
– ALollz
Nov 26 '18 at 21:14
|
show 6 more comments
0
active
oldest
votes
Your Answer
StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53489106%2fstatsmodels-ols-inf-or-nan-error-when-there-is-none-in-dataset%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
0
active
oldest
votes
0
active
oldest
votes
active
oldest
votes
active
oldest
votes
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53489106%2fstatsmodels-ols-inf-or-nan-error-when-there-is-none-in-dataset%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
NAN stands for "not a number", what is something like
t0
orx5
if not a nan?– enumaris
Nov 26 '18 at 21:11
Sounds like your columns at the end are being cast to a numeric type on accident.
– TheIncorrigible1
Nov 26 '18 at 21:11
@enumaris True. But I am doing an ANOVA analysis where certain variables will not be numbers. Any suggestions?
– maximusdooku
Nov 26 '18 at 21:14
@TheIncorrigible1 How can I force it to retain it as character?
– maximusdooku
Nov 26 '18 at 21:14
What is the output of
data.isnull().sum()
– ALollz
Nov 26 '18 at 21:14