Temple

Temple

Keep Steady & Just Do it

Book Review - Hands on Machine Learning(C2-2)

1 분 소요

All of materials(quotes, images, definitions) are from this book.
It’s all just for self-study.

Look at the big picture

Frame the Problem

First question to ask : what exactly is the business objective
How does the company expect to use and benfit from this model?

pipeline : a sequence of data processing components

Second question to ask : what the current solution looks like (if any)
will often give a reference performance, as well as insights on how to solve the problem
- typical supervised learning task(with labeld data)
- typical regression task(predict a value)
- multiple regression problem : use multiple features to make a prediction
- also a univariate regression problem : predict a single value for each district
- plain batch learning

Select a Performance Measure

RMSE(Root Mean Square Error) : corresponds to the Euclidean norm
MAE(Mean Absolute Error) : corresponds to the Manhattan norm
- The higher the norm index, the more it focuses on large values and neglects samll ones
  - This is why RMSE is more sensitive to outliers than the MAE
  - But when outliers are exponentially rare(like in a bell-shaped curve), the RMSE performs very well and is generally preferred

Check the Assumptions

We need actual values of each district for our prediction(not category e.g., “cheap”,”medium”,”expensive)

Get the Data & Create the Workspace

I’m gonna use google colab instead using jupyter notebook

Download the Data

From this part, I’ll upload this notebook on my github

This is my github repository

공유하기

Twitter Facebook LinkedIn

댓글남기기

참고

Netflix Movies and TV Shows (3) Netflix timeline

2 분 소요

앞으로 4개의 캐글 노트북에 대한 글을 정리하려 합니다. 첫번째 캐글은 Netflix Movies and TV Shows 라는 데이터셋입니다. 개인적인 논리적 흐름을 고민하는 데 있어 Netflix Research에서 Analytics와 관련된 아티클을 공부했습니다. 필요하...

Netflix Movies and TV Shows (1) About Netflix

2 분 소요

앞으로 4개의 캐글 노트북에 대한 글을 정리하려 합니다. 첫번째 캐글은 Netflix Movies and TV Shows 라는 데이터셋입니다. 개인적인 논리적 흐름을 고민하는 데 있어 Netflix Research에서 Analytics와 관련된 아티클을 공부했습니다. 필요하...

2022 KAKAO BLIND RECRUITMENT 주차 요금 계산 파이썬

1 분 소요

2022 KAKAO BLIND RECRUITMENT 주차 요금 계산 파이썬 풀이

2018 KAKAO BLIND RECRUITMENT 파일명 정리 파이썬

1 분 소요

2018 KAKAO BLIND RECRUITMENT 파일명 정리 파이썬 풀이