crunchers.statsmodels_helpers package¶

Submodules¶

crunchers.statsmodels_helpers.lazy_stats module¶

Functions for streamlining analysis.

crunchers.statsmodels_helpers.lazy_stats.build_regression_models_grid(X_hyps_dicts, ctrl_coefs_dicts, outcomes_dicts)[source]¶

crunchers.statsmodels_helpers.lazy_stats.compare_coefs(row, value, results)[source]¶

crunchers.statsmodels_helpers.lazy_stats.do_regression(data, y_var, X_ctrls=None, X_hyp=None, kind='OLS', **kwargs)[source]¶

Provide a further abstracted way to build and run multiple types of regressions.

data (pd.DataFrame): data table to use when retrieving the column headers y_var (str): column header of the outcome variable X_ctrls (str): formula specification of the “boring” variables “column_header_1 + column_header_2”… X_hyp (str): formula specification of the “interesting” variables “column_header_1 + column_header_2”… kind (str): the type of regression to run kind in [‘GLM’,’OLS’,’RLM’] == True

crunchers.statsmodels_helpers.lazy_stats.format_all_regression_models(regs, total)[source]¶

Return tuple of string formated versions of all regression tables in the regs object.

Parameters:	(reg-tree (regs) – dict-like): tree-like dict containing the regression results objects as leaves and descriptors as nodes. total (int) – total number of results tables to format.
Returns:	tuple

crunchers.statsmodels_helpers.lazy_stats.get_diff(a, b)[source]¶

crunchers.statsmodels_helpers.lazy_stats.get_log2_fold(a, b)[source]¶

crunchers.statsmodels_helpers.lazy_stats.identify_full_ctrl_names(X_vars, orig_ctrl_names)[source]¶: Return set of variable names actually used in regression, tolerating mangling of categoricals.

crunchers.statsmodels_helpers.lazy_stats.regression_grid_single(grid_item, data, kind, **kwargs)[source]¶

crunchers.statsmodels_helpers.lazy_stats.report_glm(formula, data, verbose=True, **kwargs)[source]¶: Fit GLM, print a report, and return the fit object.

crunchers.statsmodels_helpers.lazy_stats.report_logitreg(formula, data, verbose=True, disp=1)[source]¶: Fit logistic regression, print a report, and return the fit object.

crunchers.statsmodels_helpers.lazy_stats.report_ols(formula, data, fit_regularized=False, L1_wt=1, refit=False, **kwargs)[source]¶: Fit OLS regression, print a report, and return the fit object.

crunchers.statsmodels_helpers.lazy_stats.report_rlm(formula, data, verbose=True, **kwargs)[source]¶: Fit RLM, print a report, and return the fit object.

crunchers.statsmodels_helpers.lazy_stats.run_regressions_grid(grid, data, kind, max_workers=None, **kwargs)[source]¶

crunchers.statsmodels_helpers.lazy_stats.summarize_X_vars(results, sig_thresh=0.05, X_ctrls=None, X_ignore=None)[source]¶

crunchers.statsmodels_helpers.lazy_stats.summarize_grid_OLS(regs, reg_grid)[source]¶

crunchers.statsmodels_helpers.lazy_stats.summarize_grid_X_vars_OLS(regs, reg_grid, sig_thresh=0.05)[source]¶

crunchers.statsmodels_helpers.lazy_stats.summarize_multi_LOGIT(results)[source]¶: Return dataframe aggregating over-all stats from a dictionary-like object containing LOGIT result objects.

crunchers.statsmodels_helpers.lazy_stats.summarize_multi_OLS(results)[source]¶: Return dataframe aggregating over-all stats from a dictionary-like object containing OLS result objects.

crunchers.statsmodels_helpers.lazy_stats.summarize_single_OLS(regression, col_dict, name, is_regularized=False)[source]¶: Return dataframe aggregating over-all stats from a dictionary-like object containing OLS result objects.

crunchers.statsmodels_helpers.lazy_stats.tree()[source]¶

crunchers.statsmodels_helpers package¶

Submodules¶

crunchers.statsmodels_helpers.lazy_stats module¶

Module contents¶