您现在的位置是：首页 > 其它

当前栏目

[ML] 2. Introduction to neural networks

to ML Networks Neural Introduction

2023-09-14 08:59:14 时间

Training an algorithm involes four ingredients:

Data
Model
Objective function: We put data input a Model and get output out of it. The value we call it as 'lost'. We want to minimize the 'lost' value.
Optimization algorithm: For example the linear model, we will try to optimize y = wx + b, 'w' & 'b' so that it will minimize the 'lost' value.

Repeat the process...

Three types of machine learning:

Supervised: Give feedback

Classification: outputs are categories: cats or dogs
Regression: output would be numbers.

Unsupervised: No feedback, find parttens

Reinforcement: Train the algorithm to works in a enviorment based on the rewords it receives. (Just like training your dog)

Linear Model:

f(x) = x * w + b

x: input

w: coefficient / weight

b: intercept / bias

Linear Model: Multi inputs:

x, w are both vectors:

x: 1 * 2

w: 2 * 1

f(x): 1 * 1

Notice that the lienar model doesn't chage, it is still:

f(x) = x * w + b

Lienar Model: multi inputs and multi outputs:

For 'W', the first index is always the same as X; the second index is always the same as ouput Y.

If there is K inputs and M outputs, the number of Weigths would be K * M

The number of bias is equal to the number of ouputs: M.

N * M = (N * K) * (K * M) + 1 * M

Each model is determined by its weights and biases.

Objection function:

Is the measure used to evaluate how well the model's output match the desired correct values.

Loss function: the lower the loss function, the higher the level of accuracy (Supervized learning)
Reward function: the hight of the reward function, the higher the level of accuracy (Reubfircement learning)

Loss functions for Supervised learning:

Regression: L2-NORM

Classification: CROSS-ENTROPY

Expect cross-entropy should be lower.

Optimization algorithm: Dradient descent

Until one point, the following value never update anymore.

The picture looks like this:

Generally, we want the learning rate to be:

　　High enough, so we can reach the closest minimum in a rational amount of time

　　Low enough, so we don't oscillate around the minimum

N-parameter gradient descent

猜你喜欢

VS Code 必备插件推荐「建议收藏」
【重识云原生】第六章容器6.1.7.1节——Docker核心技术cgroups综述
Oracle 17081数据库性能升级之路（oracle-17081）
Mysql存储过程学习笔记--建立简单的存储过程
【说站】python TestCase测试用例怎么用
asp.net中将表单提交到另一页Code-Behind（代码和html在不同的页面）
PowerQuery制作工资条或成绩条
深入剖析SQL Server错误码（sqlserver错误码）
2023IntelliJ IDEA激活码(2023IntelliJ IDEA最新激活码)2023IntelliJ IDEA激活码
密码登录禁用密码，安全使用Redis集群（redis集群不使用）
asp.net下检测SQL注入式攻击代码
提升媒体体验：Linux机顶盒升级行动（linux机顶盒升级）
【说站】java中Process是什么
为Typecho博客添加加载时间和网站运行时间
emWin专题——emWin简介及模拟器的使用「建议收藏」
javascript学习笔记（七）利用javascript来创建和存储cookie
asp实现新评论自动发短信提示的代码
Python3.x：定时任务实现方式详解编程语言

相关主题

linq to xml
ORM TO SQL
html to pdf
LINQ to SQL
Roman to Integer

zl程序教程