Statsmodels OLS 'inf or NaN' error when there is none in dataset












0















My data is looks like this:



58.445355   97.668693   126.101506  192.417611  221.809156  289.785785  t0  x5
60.451638 101.626297 130.027950 191.922363 219.977219 296.777197 11 x88
61.961283 104.692036 132.872274 197.605787 225.731945 313.789832 t2 x4


The columns are num1 to num6 and then t and x



import statsmodels.api as sm
from statsmodels.formula.api import ols
from statsmodels.stats.anova import anova_lm

formula = 'num4 ~ C(t) + C(x) + C(t):C(x)'
model = ols(formula, data).fit()

aov_table = sm.stats.anova_lm(model, typ=2)


This leads to this error:



ValueError: array must not contain infs or NaNs


Though my data frame has no Infs or NANs.



What am I doing wrong?



P.S. I have done the exact analysis successfully using aov function in R without running into any issues.



Diagnosis



data.isnull().sum()



num1            0
num2 0
num3 0
num4 0
num5 0
num6 0
t 0
x 0
dtype: int64









share|improve this question

























  • NAN stands for "not a number", what is something like t0 or x5 if not a nan?

    – enumaris
    Nov 26 '18 at 21:11













  • Sounds like your columns at the end are being cast to a numeric type on accident.

    – TheIncorrigible1
    Nov 26 '18 at 21:11











  • @enumaris True. But I am doing an ANOVA analysis where certain variables will not be numbers. Any suggestions?

    – maximusdooku
    Nov 26 '18 at 21:14











  • @TheIncorrigible1 How can I force it to retain it as character?

    – maximusdooku
    Nov 26 '18 at 21:14











  • What is the output of data.isnull().sum()

    – ALollz
    Nov 26 '18 at 21:14
















0















My data is looks like this:



58.445355   97.668693   126.101506  192.417611  221.809156  289.785785  t0  x5
60.451638 101.626297 130.027950 191.922363 219.977219 296.777197 11 x88
61.961283 104.692036 132.872274 197.605787 225.731945 313.789832 t2 x4


The columns are num1 to num6 and then t and x



import statsmodels.api as sm
from statsmodels.formula.api import ols
from statsmodels.stats.anova import anova_lm

formula = 'num4 ~ C(t) + C(x) + C(t):C(x)'
model = ols(formula, data).fit()

aov_table = sm.stats.anova_lm(model, typ=2)


This leads to this error:



ValueError: array must not contain infs or NaNs


Though my data frame has no Infs or NANs.



What am I doing wrong?



P.S. I have done the exact analysis successfully using aov function in R without running into any issues.



Diagnosis



data.isnull().sum()



num1            0
num2 0
num3 0
num4 0
num5 0
num6 0
t 0
x 0
dtype: int64









share|improve this question

























  • NAN stands for "not a number", what is something like t0 or x5 if not a nan?

    – enumaris
    Nov 26 '18 at 21:11













  • Sounds like your columns at the end are being cast to a numeric type on accident.

    – TheIncorrigible1
    Nov 26 '18 at 21:11











  • @enumaris True. But I am doing an ANOVA analysis where certain variables will not be numbers. Any suggestions?

    – maximusdooku
    Nov 26 '18 at 21:14











  • @TheIncorrigible1 How can I force it to retain it as character?

    – maximusdooku
    Nov 26 '18 at 21:14











  • What is the output of data.isnull().sum()

    – ALollz
    Nov 26 '18 at 21:14














0












0








0








My data is looks like this:



58.445355   97.668693   126.101506  192.417611  221.809156  289.785785  t0  x5
60.451638 101.626297 130.027950 191.922363 219.977219 296.777197 11 x88
61.961283 104.692036 132.872274 197.605787 225.731945 313.789832 t2 x4


The columns are num1 to num6 and then t and x



import statsmodels.api as sm
from statsmodels.formula.api import ols
from statsmodels.stats.anova import anova_lm

formula = 'num4 ~ C(t) + C(x) + C(t):C(x)'
model = ols(formula, data).fit()

aov_table = sm.stats.anova_lm(model, typ=2)


This leads to this error:



ValueError: array must not contain infs or NaNs


Though my data frame has no Infs or NANs.



What am I doing wrong?



P.S. I have done the exact analysis successfully using aov function in R without running into any issues.



Diagnosis



data.isnull().sum()



num1            0
num2 0
num3 0
num4 0
num5 0
num6 0
t 0
x 0
dtype: int64









share|improve this question
















My data is looks like this:



58.445355   97.668693   126.101506  192.417611  221.809156  289.785785  t0  x5
60.451638 101.626297 130.027950 191.922363 219.977219 296.777197 11 x88
61.961283 104.692036 132.872274 197.605787 225.731945 313.789832 t2 x4


The columns are num1 to num6 and then t and x



import statsmodels.api as sm
from statsmodels.formula.api import ols
from statsmodels.stats.anova import anova_lm

formula = 'num4 ~ C(t) + C(x) + C(t):C(x)'
model = ols(formula, data).fit()

aov_table = sm.stats.anova_lm(model, typ=2)


This leads to this error:



ValueError: array must not contain infs or NaNs


Though my data frame has no Infs or NANs.



What am I doing wrong?



P.S. I have done the exact analysis successfully using aov function in R without running into any issues.



Diagnosis



data.isnull().sum()



num1            0
num2 0
num3 0
num4 0
num5 0
num6 0
t 0
x 0
dtype: int64






python pandas statistics statsmodels anova






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Nov 27 '18 at 18:52









jtweeder

33418




33418










asked Nov 26 '18 at 21:09









maximusdookumaximusdooku

1,48221445




1,48221445













  • NAN stands for "not a number", what is something like t0 or x5 if not a nan?

    – enumaris
    Nov 26 '18 at 21:11













  • Sounds like your columns at the end are being cast to a numeric type on accident.

    – TheIncorrigible1
    Nov 26 '18 at 21:11











  • @enumaris True. But I am doing an ANOVA analysis where certain variables will not be numbers. Any suggestions?

    – maximusdooku
    Nov 26 '18 at 21:14











  • @TheIncorrigible1 How can I force it to retain it as character?

    – maximusdooku
    Nov 26 '18 at 21:14











  • What is the output of data.isnull().sum()

    – ALollz
    Nov 26 '18 at 21:14



















  • NAN stands for "not a number", what is something like t0 or x5 if not a nan?

    – enumaris
    Nov 26 '18 at 21:11













  • Sounds like your columns at the end are being cast to a numeric type on accident.

    – TheIncorrigible1
    Nov 26 '18 at 21:11











  • @enumaris True. But I am doing an ANOVA analysis where certain variables will not be numbers. Any suggestions?

    – maximusdooku
    Nov 26 '18 at 21:14











  • @TheIncorrigible1 How can I force it to retain it as character?

    – maximusdooku
    Nov 26 '18 at 21:14











  • What is the output of data.isnull().sum()

    – ALollz
    Nov 26 '18 at 21:14

















NAN stands for "not a number", what is something like t0 or x5 if not a nan?

– enumaris
Nov 26 '18 at 21:11







NAN stands for "not a number", what is something like t0 or x5 if not a nan?

– enumaris
Nov 26 '18 at 21:11















Sounds like your columns at the end are being cast to a numeric type on accident.

– TheIncorrigible1
Nov 26 '18 at 21:11





Sounds like your columns at the end are being cast to a numeric type on accident.

– TheIncorrigible1
Nov 26 '18 at 21:11













@enumaris True. But I am doing an ANOVA analysis where certain variables will not be numbers. Any suggestions?

– maximusdooku
Nov 26 '18 at 21:14





@enumaris True. But I am doing an ANOVA analysis where certain variables will not be numbers. Any suggestions?

– maximusdooku
Nov 26 '18 at 21:14













@TheIncorrigible1 How can I force it to retain it as character?

– maximusdooku
Nov 26 '18 at 21:14





@TheIncorrigible1 How can I force it to retain it as character?

– maximusdooku
Nov 26 '18 at 21:14













What is the output of data.isnull().sum()

– ALollz
Nov 26 '18 at 21:14





What is the output of data.isnull().sum()

– ALollz
Nov 26 '18 at 21:14












0






active

oldest

votes











Your Answer






StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53489106%2fstatsmodels-ols-inf-or-nan-error-when-there-is-none-in-dataset%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes
















draft saved

draft discarded




















































Thanks for contributing an answer to Stack Overflow!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53489106%2fstatsmodels-ols-inf-or-nan-error-when-there-is-none-in-dataset%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

A CLEAN and SIMPLE way to add appendices to Table of Contents and bookmarks

Calculate evaluation metrics using cross_val_predict sklearn

Insert data from modal to MySQL (multiple modal on website)