본문 바로가기
컴퓨터쟁이/Machine Learning

실제 데이터로 작업하기

by 빙글빙글이 2021. 1. 18.
728x90
반응형

머신러닝을 배울 때는 실제 데이터를 사용해 실험하는 것이 가장 좋다.

 

아래 링크를 통해 공개되어있는 데이터를 활용하면 좋다.  

 

UC 얼바인(Irvine) 머신러닝 저장소  

archive.ics.uci.edu/ml/index.php

 

UCI Machine Learning Repository

Welcome to the UC Irvine Machine Learning Repository! We currently maintain 559 data sets as a service to the machine learning community. You may view all data sets through our searchable interface. For a general overview of the Repository, please visit ou

archive.ics.uci.edu

캐글(Kaggle) 데이터셋 

www.kaggle.com/datasets

 

Find Open Datasets and Machine Learning Projects | Kaggle

Download Open Datasets on 1000s of Projects + Share Projects on One Platform. Explore Popular Topics Like Government, Sports, Medicine, Fintech, Food, More. Flexible Data Ingestion.

www.kaggle.com

아마존 AWS 데이터셋

registry.opendata.aws/

 

Registry of Open Data on AWS

astronomy The data are from observations with the Murchison Widefield Array (MWA) which is a Square Kilometer Array (SKA) precursor in Western Australia. This particular dataset is from the Epoch of Reionization project which is a key science driver of the

registry.opendata.aws

데이터 포털(Data Prtals)

dataportals.org/search

 

DataPortals.org - A Comprehensive List of Open Data Portals from Around the World

This service is run by Open Knowledge Foundation | Source Code | Download Data (CSV) | Download Data (JSON) | Data License (Public Domain) | Privacy Policy

dataportals.org

오픈 데이터 모니터(Open Data Monitor)

www.opendatamonitor.eu/

 

OpenDataMonitor

This measure is an average of the missing metadata across a defined set of fields: licence, author, organisation, date released and date updated.

www.opendatamonitor.eu

퀀들(Quandl)

www.quandl.com/

 

Quandl

The source for financial, economic, and alternative datasets, serving investment professionals.

www.quandl.com

위키백과 머신러닝 데이터셋 목록

en.wikipedia.org/wiki/List_of_datasets_for_machine-learning_research

 

List of datasets for machine-learning research - Wikipedia

These datasets are used for machine-learning research and have been cited in peer-reviewed academic journals. Datasets are an integral part of the field of machine learning. Major advances in this field can result from advances in learning algorithms (such

en.wikipedia.org

Quora

www.quora.com/Where-can-I-find-large-datasets-open-to-the-public

 

Where can I find large datasets open to the public?

Answer (1 of 210): Reposting from Alan Morrison's answer to Where on the web can I find free samples of Big Data sets, of, e.g., countries, cities, or individuals, to analyze? This link list, available on Github, is quite long and thorough: caesar0301/awes

www.quora.com

데이터셋 서브레딧(subreddit)

www.reddit.com/r/datasets/

 

Datasets • r/datasets

A place to share, find, and discuss Datasets.

www.reddit.com

깃허브

github.com/awesomedata/awesome-public-datasets

 

awesomedata/awesome-public-datasets

A topic-centric list of HQ open datasets. Contribute to awesomedata/awesome-public-datasets development by creating an account on GitHub.

github.com

한국 공공 데이터 

www.data.go.kr/

 

공공데이터 포털

국가에서 보유하고 있는 다양한 데이터를『공공데이터의 제공 및 이용 활성화에 관한 법률(제11956호)』에 따라 개방하여 국민들이 보다 쉽고 용이하게 공유•활용할 수 있도록 공공데이터(Datase

www.data.go.kr

 

728x90
반응형