Pywayne Statistics

Comprehensive statistical testing library with 37+ methods for normality tests, location tests, correlation tests, time series tests, and model diagnostics....

MIT-0 · Free to use, modify, and redistribute. No attribution required.

⭐ 0 · 480 · 0 current installs · 0 all-time installs

by@wangyendt

MIT-0

Security Scan

VirusTotal

Benign

View report →

OpenClaw

Benign

medium confidence

✓

Purpose & Capability

The name/description match the contents of SKILL.md: it documents many statistical tests and shows example usage. The declared capability aligns with the methods and examples shown.

ℹ

Instruction Scope

The instructions are examples that import 'pywayne.statistics' and call API methods, but the skill provides no install steps or code files. The SKILL.md does not direct the agent to read files, environment variables, or send data to external endpoints — so there is no scope creep. However, runtime use will require the referenced Python package to actually exist in the environment; the skill does not provide provenance or installation guidance.

✓

Install Mechanism

No install spec is provided (instruction-only), which minimizes installation risk. Because nothing is downloaded or written, there is no install-time code execution risk from the skill itself.

✓

Credentials

No environment variables, secrets, or config paths are requested. The claimed functionality (statistical tests) does not require additional credentials, so the lack of requested secrets is appropriate.

✓

Persistence & Privilege

The skill does not request 'always' presence and does not instruct modifying system or other skills' configurations. Default autonomous invocation is allowed but not accompanied by increased privileges.

Assessment

This SKILL.md is documentation-only for a Python package; it does not include the package source or installation steps. Before relying on it, verify that the 'pywayne' package (or equivalent) is available from a trusted source (PyPI/GitHub) and inspect its source code or provenance. If you plan to execute the example code, pip-install only from known repositories and run in an isolated environment. There are no direct signs of data-exfiltration or credential requests in the skill itself, but lack of provenance lowers confidence — confirm the package origin before installing or running it.

Like a lobster shell, security has layers — review code before you run it.

Current versionv0.1.0

Download zip

latestvk97eq7yyrqtwpwg61t8xbgmxpx818ew3

License

MIT-0

Free to use, modify, and redistribute. No attribution required.

Termshttps://spdx.org/licenses/MIT-0.html

SKILL.md

Pywayne Statistics

Comprehensive statistical testing library for hypothesis testing, A/B testing, and data analysis.

Quick Start

from pywayne.statistics import NormalityTests, LocationTests
import numpy as np

# Test data normality
nt = NormalityTests()
data = np.random.normal(0, 1, 100)
result = nt.shapiro_wilk(data)
print(f"p-value: {result.p_value:.4f}, is_normal: {not result.reject_null}")

# Compare two groups
lt = LocationTests()
group_a = np.random.normal(100, 15, 50)
group_b = np.random.normal(105, 15, 50)
result = lt.two_sample_ttest(group_a, group_b)
print(f"Significant difference: {result.reject_null}")

Test Categories

NormalityTests (`NormalityTests`)

Test if data follows a normal distribution or other specified distributions.

Method	Description	Use Case
`shapiro_wilk`	Shapiro-Wilk test	Small-medium samples (n ≤ 5000)
`ks_test_normal`	K-S normality test	Medium-large samples
`ks_test_two_sample`	Two-sample K-S test	Compare two sample distributions
`anderson_darling`	Anderson-Darling test	Tail-sensitive normality test
`dagostino_pearson`	D'Agostino-Pearson K²	Based on skewness and kurtosis
`jarque_bera`	Jarque-Bera test	Large samples, regression residuals
`chi_square_goodness_of_fit`	Chi-square goodness-of-fit	Categorical data
`lilliefors_test`	Lilliefors test	Unknown parameters K-S test

Example:

from pywayne.statistics import NormalityTests

nt = NormalityTests()
result = nt.shapiro_wilk(data)
if result.p_value < 0.05:
    print("Data is NOT normally distributed")
else:
    print("Data follows normal distribution")

LocationTests (`LocationTests`)

Compare means or medians across groups (parametric and non-parametric).

Method	Description	Use Case
`one_sample_ttest`	One-sample t-test	Compare sample mean to a value
`two_sample_ttest`	Two-sample t-test	Compare two independent group means
`paired_ttest`	Paired t-test	Compare before/after measurements
`one_way_anova`	One-way ANOVA	Compare 3+ group means
`mann_whitney_u`	Mann-Whitney U test	Non-parametric two-sample test
`wilcoxon_signed_rank`	Wilcoxon signed-rank	Non-parametric paired test
`kruskal_wallis`	Kruskal-Wallis H test	Non-parametric multi-group test

Example (A/B Testing):

from pywayne.statistics import LocationTests, NormalityTests

lt = LocationTests()
nt = NormalityTests()

# Check normality first
if nt.shapiro_wilk(control).p_value > 0.05:
    result = lt.two_sample_ttest(control, treatment)
else:
    result = lt.mann_whitney_u(control, treatment)

print(f"Effect significant: {result.reject_null}")

CorrelationTests (`CorrelationTests`)

Test correlation between variables and independence of categorical variables.

Method	Description	Use Case
`pearson_correlation`	Pearson correlation	Linear relationship
`spearman_correlation`	Spearman's rank	Monotonic relationship
`kendall_tau`	Kendall's tau	Rank correlation, small samples
`chi_square_independence`	Chi-square independence	Categorical variables
`fisher_exact_test`	Fisher's exact test	2×2 contingency table
`mcnemar_test`	McNemar's test	Paired categorical data

Example:

from pywayne.statistics import CorrelationTests

ct = CorrelationTests()
result = ct.pearson_correlation(x, y)
print(f"Correlation: {result.statistic:.3f}, p-value: {result.p_value:.4f}")

TimeSeriesTests (`TimeSeriesTests`)

Test time series properties: stationarity, autocorrelation, cointegration.

Method	Description	Use Case
`adf_test`	Augmented Dickey-Fuller	Unit root test for stationarity
`kpss_test`	KPSS test	Stationarity test (complements ADF)
`ljung_box_test`	Ljung-Box Q test	Overall autocorrelation
`runs_test`	Runs test	Randomness testing
`arch_test`	ARCH effect test	Heteroscedasticity
`granger_causality`	Granger causality	Causal relationship
`engle_granger_cointegration`	Engle-Granger cointegration	Long-term equilibrium
`breusch_godfrey_test`	Breusch-Godfrey	Higher-order autocorrelation

Example:

from pywayne.statistics import TimeSeriesTests

tst = TimeSeriesTests()
adf_result = tst.adf_test(time_series_data)
kpss_result = tst.kpss_test(time_series_data)

if adf_result.reject_null:
    print("Series is stationary")
else:
    print("Series has unit root (non-stationary)")

ModelDiagnostics (`ModelDiagnostics`)

Regression model diagnostics: heteroscedasticity, autocorrelation, multicollinearity.

Method	Description	Use Case
`breusch_pagan_test`	Breusch-Pagan	Heteroscedasticity test
`white_test`	White's test	General heteroscedasticity
`goldfeld_quandt_test`	Goldfeld-Quandt	Structural break heteroscedasticity
`durbin_watson_test`	Durbin-Watson	First-order autocorrelation
`variance_inflation_factor`	VIF	Multicollinearity diagnosis
`levene_test`	Levene's test	Homogeneity of variance
`bartlett_test`	Bartlett's test	Homogeneity (normal assumption)
`residual_normality_test`	Residual normality	Regression assumption check

Example:

from pywayne.statistics import ModelDiagnostics

md = ModelDiagnostics()
residuals = y - model.predict(X)

# Check assumptions
bp_result = md.breusch_pagan_test(residuals, X)
dw_result = md.durbin_watson_test(residuals)

if bp_result.reject_null:
    print("Warning: Heteroscedasticity detected")

TestResult Object

All test methods return a unified TestResult object:

result = nt.shapiro_wilk(data)

# Access results
result.test_name        # Test method name
result.statistic        # Test statistic value
result.p_value          # P-value
result.reject_null      # True if null hypothesis is rejected
result.critical_value   # Critical value (if applicable)
result.confidence_interval # Tuple (lower, upper) if applicable
result.effect_size      # Effect size if applicable
result.additional_info  # Dict with additional information

Utility Functions

`list_all_tests()`

List all available test methods across all modules.

from pywayne.statistics import list_all_tests
print(list_all_tests())

`show_test_usage(method_name)`

Display usage and documentation for a specific test.

from pywayne.statistics import show_test_usage
show_test_usage('shapiro_wilk')

Method Selection Guide

Normality Tests

Sample Size	Recommended Method
n < 30	Shapiro-Wilk
30 ≤ n ≤ 300	Shapiro-Wilk, D'Agostino-Pearson
n > 300	Jarque-Bera, Kolmogorov-Smirnov

Location Tests

Condition	Parametric	Non-parametric
Normal data	t-test, ANOVA	-
Non-normal data	-	Mann-Whitney U, Kruskal-Wallis
Paired data	Paired t-test	Wilcoxon signed-rank

Multiple Testing Correction

When performing multiple tests, apply p-value correction:

from statsmodels.stats.multitest import multipletests

p_values = [r.p_value for r in results]
rejected, p_corrected, _, _ = multipletests(
    p_values, alpha=0.05, method='fdr_bh'
)

Common Applications

Data Quality Check

def data_quality_check(data):
    nt = NormalityTests()
    lt = LocationTests()

    normality = nt.shapiro_wilk(data)

    # Outlier detection (IQR)
    Q1, Q3 = np.percentile(data, [25, 75])
    IQR = Q3 - Q1
    outliers = data[(data < Q1 - 1.5*IQR) | (data > Q3 + 1.5*IQR)]

    return {
        'size': len(data),
        'is_normal': not normality.reject_null,
        'p_value': normality.p_value,
        'outliers': len(outliers)
    }

A/B Testing Workflow

def ab_test_analysis(control, treatment):
    nt = NormalityTests()
    lt = LocationTests()

    # Check normality
    norm_c = nt.shapiro_wilk(control[:100])
    norm_t = nt.shapiro_wilk(treatment[:100])

    # Choose appropriate test
    if norm_c.p_value > 0.05 and norm_t.p_value > 0.05:
        result = lt.two_sample_ttest(control, treatment)
    else:
        result = lt.mann_whitney_u(control, treatment)

    return {
        'test_used': result.test_name,
        'p_value': result.p_value,
        'significant': result.reject_null,
        'effect_size': result.effect_size
    }

Regression Model Diagnostics

def diagnose_model(y, X, model):
    md = ModelDiagnostics()
    residuals = y - model.predict(X)

    return {
        'heteroscedasticity_bp': md.breusch_pagan_test(residuals, X).reject_null,
        'autocorrelation_dw': md.durbin_watson_test(residuals).statistic,
        'residuals_normal': md.residual_normality_test(residuals).p_value,
        'vif_max': max(md.variance_inflation_factor(X))
    }

Notes

All methods accept np.ndarray or list as input
All methods return TestResult with consistent interface
Always validate test assumptions before applying parametric tests
Apply multiple testing correction when performing several tests
Report effect sizes alongside p-values for complete interpretation

Files

1 total

Select a file

Select a file to preview.

Comments

Loading comments…

Pywayne Statistics

License

SKILL.md

Pywayne Statistics

Quick Start

Test Categories

NormalityTests (NormalityTests)

LocationTests (LocationTests)

CorrelationTests (CorrelationTests)

TimeSeriesTests (TimeSeriesTests)

ModelDiagnostics (ModelDiagnostics)

TestResult Object

Utility Functions

list_all_tests()

show_test_usage(method_name)

Method Selection Guide

Normality Tests

Location Tests

Multiple Testing Correction

Common Applications

Data Quality Check

A/B Testing Workflow

Regression Model Diagnostics

Notes

Files

Comments

NormalityTests (`NormalityTests`)

LocationTests (`LocationTests`)

CorrelationTests (`CorrelationTests`)

TimeSeriesTests (`TimeSeriesTests`)

ModelDiagnostics (`ModelDiagnostics`)

`list_all_tests()`

`show_test_usage(method_name)`