您现在的位置是：首页 > 工具

当前栏目

第五章线性回归学习笔记中

笔记学习回归线性第五章

2023-09-27 14:25:50 时间

5-5 衡量线性回归法的指标 MSE,RMS,MAE05-Regression-Metrics-MSE-vs-MAE

封装我们自己的评测函数

5-6 最好的衡量线性回归法的指标 R Squared

5-5 衡量线性回归法的指标 MSE,RMS,MAE05-Regression-Metrics-MSE-vs-MAE

其实是对训练数据集来说

衡量标准与样本数无关

量纲是平方，有时候会比较麻烦所以

x = x[y < 50.0]
y = y[y < 50.0]

没想明白？ x，y怎么都是判断y<50

SimpleLinearRegression.py

import numpy as np


class SimpleLinearRegression:

    def __init__(self):
        """初始化Simple Linear Regression模型"""
        self.a_ = None
        self.b_ = None

    def fit(self, x_train, y_train):
        """根据训练数据集x_train训练Simple Linear Regression模型"""
        assert x_train.ndim == 1, \
            "Simple Linear Regressor can only solve single feature training data."
        assert len(x_train) == len(y_train), \
            "the size of x_train must be equal to the size of y_train"

        x_mean = np.mean(x_train)
        y_mean = np.mean(y_train)

        self.a_ = (x_train - x_mean).dot(y_train - y_mean) / (x_train - x_mean).dot(x_train - x_mean)
        self.b_ = y_mean - self.a_ * x_mean

        return self

    def predict(self, x_predict):
        """给定待预测数据集x_predict，返回表示x_predict的结果向量"""
        assert x_predict.ndim == 1, \
            "Simple Linear Regressor can only solve single feature training data."
        assert self.a_ is not None and self.b_ is not None, \
            "must fit before predict!"

        return np.array([self._predict(x) for x in x_predict])

    def _predict(self, x_single):
        """给定单个待预测数据x，返回x的预测结果值"""
        return self.a_ * x_single + self.b_

    def __repr__(self):
        return "SimpleLinearRegression()"

封装我们自己的评测函数

metrics.py

import numpy as np
from math import sqrt


def accuracy_score(y_true, y_predict):
    """计算y_true和y_predict之间的准确率"""
    assert len(y_true) == len(y_predict), \
        "the size of y_true must be equal to the size of y_predict"

    return np.sum(y_true == y_predict) / len(y_true)


def mean_squared_error(y_true, y_predict):
    """计算y_true和y_predict之间的MSE"""
    assert len(y_true) == len(y_predict), \
        "the size of y_true must be equal to the size of y_predict"

    return np.sum((y_true - y_predict)**2) / len(y_true)


def root_mean_squared_error(y_true, y_predict):
    """计算y_true和y_predict之间的RMSE"""

    return sqrt(mean_squared_error(y_true, y_predict))


def mean_absolute_error(y_true, y_predict):
    """计算y_true和y_predict之间的MAE"""

    return np.sum(np.absolute(y_true - y_predict)) / len(y_true)

RMSE有放大错误值的趋势，而MAE没有， RMSE尽量小则其最大错误值比较小，其本质是在减小最大的识差的那个值

5-6 最好的衡量线性回归法的指标 R Squared

RMSE和MAE没有这样的体现

预测一真值，平均值—y

意义是什么？为什么好？

假设数据间有一定的线性关系

两个脚本在同一个目录或文件夹下，则一个引用另一个可以用 aaa.bbb 省略aaa则 .bbb

SimpleLinearRegression.py

import numpy as np
from .metrics import r2_score


class SimpleLinearRegression:

    def __init__(self):
        """初始化Simple Linear Regression模型"""
        self.a_ = None
        self.b_ = None

    def fit(self, x_train, y_train):
        """根据训练数据集x_train训练Simple Linear Regression模型"""
        assert x_train.ndim == 1, \
            "Simple Linear Regressor can only solve single feature training data."
        assert len(x_train) == len(y_train), \
            "the size of x_train must be equal to the size of y_train"

        x_mean = np.mean(x_train)
        y_mean = np.mean(y_train)

        self.a_ = (x_train - x_mean).dot(y_train - y_mean) / (x_train - x_mean).dot(x_train - x_mean)
        self.b_ = y_mean - self.a_ * x_mean

        return self

    def predict(self, x_predict):
        """给定待预测数据集x_predict，返回表示x_predict的结果向量"""
        assert x_predict.ndim == 1, \
            "Simple Linear Regressor can only solve single feature training data."
        assert self.a_ is not None and self.b_ is not None, \
            "must fit before predict!"

        return np.array([self._predict(x) for x in x_predict])

    def _predict(self, x_single):
        """给定单个待预测数据x，返回x的预测结果值"""
        return self.a_ * x_single + self.b_

    def score(self, x_test, y_test):
        """根据测试数据集 x_test 和 y_test 确定当前模型的准确度"""

        y_predict = self.predict(x_test)
        return r2_score(y_test, y_predict)

    def __repr__(self):
        return "SimpleLinearRegression()"

metrics.py

import numpy as np
from math import sqrt


def accuracy_score(y_true, y_predict):
    """计算y_true和y_predict之间的准确率"""
    assert len(y_true) == len(y_predict), \
        "the size of y_true must be equal to the size of y_predict"

    return np.sum(y_true == y_predict) / len(y_true)


def mean_squared_error(y_true, y_predict):
    """计算y_true和y_predict之间的MSE"""
    assert len(y_true) == len(y_predict), \
        "the size of y_true must be equal to the size of y_predict"

    return np.sum((y_true - y_predict)**2) / len(y_true)


def root_mean_squared_error(y_true, y_predict):
    """计算y_true和y_predict之间的RMSE"""

    return sqrt(mean_squared_error(y_true, y_predict))


def mean_absolute_error(y_true, y_predict):
    """计算y_true和y_predict之间的MAE"""
    assert len(y_true) == len(y_predict), \
        "the size of y_true must be equal to the size of y_predict"

    return np.sum(np.absolute(y_true - y_predict)) / len(y_true)


def r2_score(y_true, y_predict):
    """计算y_true和y_predict之间的R Square"""

    return 1 - mean_squared_error(y_true, y_predict)/np.var(y_true)

scikit-learn中的LinearRegression中的score返回r2_score

https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LinearRegression.html

猜你喜欢

【故障处理】队列等待之enq IV - contention案例
异常检测：TODS工具库（与PyOD类似）【包含多种时间序列上的异常检测算法】
Confluence 6 从一个 XML 备份中导入一个空间
github下载加速
Hibernate 查询语言
产品推介 | 分布式云物理计算服务-助力企业把握5G机遇
r个有标志的球放进n个不同的盒子里，要求无一空盒，问有多少种不同的分配方案？
jmeter -- beanshell 执行本地py文件
139.前后端分离与前后端不分离区别、实际应用中该如何选择？
美国芯片厂商收购Novati 计划短期开发高度差异化的产品
URL刷新工具(绿色版)　>　也许对你有用哦~~！
poj1050 To the Max (动态规划)
3、17笔记
瞎折腾之Webhooks
关于数组乱序的深挖——“感觉一直在写毒代码”
memcached的分布式算法-Consistent Hashing
Android自定义控件：类QQ未读消息拖拽效果
你必须了解的基础的Linux网络命令

相关主题

IO流学习笔记
H.264学习笔记
php学习笔记1
Jenkins | 笔记
SpringMVC学习笔记----
笔记笔记笔记
Struts2笔记

zl程序教程

当前栏目

第五章线性回归学习笔记中

5-5 衡量线性回归法的指标 MSE,RMS,MAE05-Regression-Metrics-MSE-vs-MAE

封装我们自己的评测函数

5-6 最好的衡量线性回归法的指标 R Squared

相关文章

当前栏目

第五章 线性回归 学习笔记中

5-5 衡量线性回归法的指标 MSE,RMS,MAE05-Regression-Metrics-MSE-vs-MAE

封装我们自己的评测函数

5-6 最好的衡量线性回归法的指标 R Squared

相关文章

第五章线性回归学习笔记中