Learn data science and Machine Learning

@datasciencefun

Гео и язык канала: не указан, не указан

Категория: не указана

Join this channel to learn data science and machine learning with funny quizzes and amazing resources for free
If possible, share this channel link with your friends and family members who want to learn data science.
http://t.me/datasciencefun

Связанные каналы | Похожие каналы

Гео и язык канала

не указан, не указан

Категория

не указана

Статистика

Избранное

Фильтр публикаций

Скрывать удаленные

Скрывать репосты

Learn data science and Machine Learning

13 Mar 2021, 08:18

You will prefer YouTube videos in which language?

Опрос

Hindi
English
Mix of both

767 голосов

1.6k 1 1

Learn data science and Machine Learning

11 Mar 2021, 23:53

Rules of Machine Learning.pdf

449.5Кб

970 0 22

Learn data science and Machine Learning

9 Mar 2021, 20:59

Perfect resources for beginners
https://www.kaggle.com/kanncaa1/data-sciencetutorial-for-beginners

Data ScienceTutorial for Beginners

Explore and run machine learning code with Kaggle Notebooks | Using data from Pokemon- Weedle's Cave

1.4k 0 22

Learn data science and Machine Learning

26 Feb 2021, 08:38

🔰Data Science [All Courses] 🔰

🌀Source : Udacity
🌀Size : 54.05 GB

🔗Link:
https://mega.nz/#F!qrpxSIRD!PClG5ZMHdd5FroIFTT_Z5Q

💢 Share and Support Us 💢

2.4k 0 84

Learn data science and Machine Learning

23 Feb 2021, 23:29

https://github.com/datastacktv/data-engineer-roadmap

datastacktv/data-engineer-roadmap

Roadmap to becoming a data engineer in 2021. Contribute to datastacktv/data-engineer-roadmap development by creating an account on GitHub.

2.3k 0 17

Learn data science and Machine Learning

12 Feb 2021, 14:21

https://towardsdatascience.com/my-journey-into-data-science-39e9bbbbf452

My Journey Into Data Science

Quite a number of people have asked me about my switch from Chemical Engineering to Data Science. How did I do it? When did I do it? Why…

2.6k 0 10

Learn data science and Machine Learning

8 Feb 2021, 18:02

https://towardsdatascience.com/you-should-master-data-analytics-first-before-becoming-a-data-scientist-5dbceaea9d3d

You Should Master Data Analytics First Before Becoming a Data Scientist

Here are 4 reasons why…

2.5k 0 8

Learn data science and Machine Learning

6 Feb 2021, 09:15

2.1k 0 16

Learn data science and Machine Learning

6 Feb 2021, 07:51

SQL for Data Science.pdf.pdf

1.6Мб

SQL for Data Science

1.7k 0 54

Learn data science and Machine Learning

5 Feb 2021, 14:34

Intershala Machine learning

https://drive.google.com/folderview?id=1tZFMiQ-ZD9rZ4epFSd4lVCCuwqhQt0_9

1.7k 0 56

Learn data science and Machine Learning

2 Feb 2021, 08:48

What do you want to learn?

Опрос

Data science from scratch
Machine Learning and it's algorithms from scratch
Projects on machine learning
Projects on data analysis and data science

419 голосов

2k 0 1

Learn data science and Machine Learning

2 Feb 2021, 08:44

Logistic regression fits a logistic model to data and makes predictions about the probability of an event (between 0 and 1).

Naive Bayes uses Bayes Theorem to model the conditional relationship of each attribute to the class variable.

The k-Nearest Neighbor (kNN) method makes predictions by locating similar cases to a given data instance (using a similarity function) and returning the average or majority of the most similar data instances. The kNN algorithm can be used for classification or regression.

Classification and Regression Trees (CART) are constructed from a dataset by making splits that best separate the data for the classes or predictions being made. The CART algorithm can be used for classification or regression.

Support Vector Machines (SVM) are a method that uses points in a transformed problem space that best separate classes into two groups. Classification for multiple classes is supported by a one-vs-all method. SVM also supports regression by modeling the function with a minimum amount of allowable error.

1.8k 0 27

Learn data science and Machine Learning

2 Feb 2021, 08:26

Three different learning styles in machine learning algorithms:

1. Supervised Learning

Input data is called training data and has a known label or result such as spam/not-spam or a stock price at a time.

A model is prepared through a training process in which it is required to make predictions and is corrected when those predictions are wrong. The training process continues until the model achieves a desired level of accuracy on the training data.

Example problems are classification and regression.

Example algorithms include: Logistic Regression and the Back Propagation Neural Network.

2. Unsupervised Learning

Input data is not labeled and does not have a known result.

A model is prepared by deducing structures present in the input data. This may be to extract general rules. It may be through a mathematical process to systematically reduce redundancy, or it may be to organize data by similarity.

Example problems are clustering, dimensionality reduction and association rule learning.

Example algorithms include: the Apriori algorithm and K-Means.

3. Semi-Supervised Learning

Input data is a mixture of labeled and unlabelled examples.

There is a desired prediction problem but the model must learn the structures to organize the data as well as make predictions.

Example problems are classification and regression.

Example algorithms are extensions to other flexible methods that make assumptions about how to model the unlabeled data.

1.5k 0 30

Learn data science and Machine Learning

2 Feb 2021, 08:21

1.4k 0 21

Learn data science and Machine Learning

30 Jan 2021, 23:26

What is data munging?

Опрос

finding out more insights from the data
cleaning the data and playing with it to make it better suit statistical modeling
running the actual algorithms to predict outcome

274 голосов

1.4k 0 5

Learn data science and Machine Learning

30 Jan 2021, 23:22

Some useful PYTHON libraries for data science

NumPy stands for Numerical Python. The most powerful feature of NumPy is n-dimensional array. This library also contains basic linear algebra functions, Fourier transforms, advanced random number capabilities and tools for integration with other low level languages like Fortran, C and C++

SciPy stands for Scientific Python. SciPy is built on NumPy. It is one of the most useful library for variety of high level science and engineering modules like discrete Fourier transform, Linear Algebra, Optimization and Sparse matrices.

Matplotlib for plotting vast variety of graphs, starting from histograms to line plots to heat plots.. You can use Pylab feature in ipython notebook (ipython notebook –pylab = inline) to use these plotting features inline. If you ignore the inline option, then pylab converts ipython environment to an environment, very similar to Matlab. You can also use Latex commands to add math to your plot.

Pandas for structured data operations and manipulations. It is extensively used for data munging and preparation. Pandas were added relatively recently to Python and have been instrumental in boosting Python’s usage in data scientist community.

Scikit Learn for machine learning. Built on NumPy, SciPy and matplotlib, this library contains a lot of efficient tools for machine learning and statistical modeling including classification, regression, clustering and dimensionality reduction.

Statsmodels for statistical modeling. Statsmodels is a Python module that allows users to explore data, estimate statistical models, and perform statistical tests. An extensive list of descriptive statistics, statistical tests, plotting functions, and result statistics are available for different types of data and each estimator.

Seaborn for statistical data visualization. Seaborn is a library for making attractive and informative statistical graphics in Python. It is based on matplotlib. Seaborn aims to make visualization a central part of exploring and understanding data.

Bokeh for creating interactive plots, dashboards and data applications on modern web-browsers. It empowers the user to generate elegant and concise graphics in the style of D3.js. Moreover, it has the capability of high-performance interactivity over very large or streaming datasets.

Blaze for extending the capability of Numpy and Pandas to distributed and streaming datasets. It can be used to access data from a multitude of sources including Bcolz, MongoDB, SQLAlchemy, Apache Spark, PyTables, etc. Together with Bokeh, Blaze can act as a very powerful tool for creating effective visualizations and dashboards on huge chunks of data.

Scrapy for web crawling. It is a very useful framework for getting specific patterns of data. It has the capability to start at a website home url and then dig through web-pages within the website to gather information.

SymPy for symbolic computation. It has wide-ranging capabilities from basic symbolic arithmetic to calculus, algebra, discrete mathematics and quantum physics. Another useful feature is the capability of formatting the result of the computations as LaTeX code.

Requests for accessing the web. It works similar to the the standard python library urllib2 but is much easier to code. You will find subtle differences with urllib2 but for beginners, Requests might be more convenient.

Additional libraries, you might need:

os for Operating system and file operations

networkx and igraph for graph based data manipulations

regular expressions for finding patterns in text data

BeautifulSoup for scrapping web. It is inferior to Scrapy as it will extract information from just a single webpage in a run.

1.5k 0 41

Learn data science and Machine Learning

21 Jan 2021, 18:28

Which library is used to implement machine learning algorithms on datasets?

Опрос

Numpy
Scikit-learn

528 голосов

2.2k 0 5

Learn data science and Machine Learning

21 Jan 2021, 18:26

Which library provides methods required for linear algebra, matrix manipulations and Fourier transformation?

Опрос

Pandas
Numpy
Matplotlib
Scikit-learn

499 голосов

2k 0 5

Learn data science and Machine Learning

21 Jan 2021, 18:24

Seaborn and Matplotlib are used for?

Опрос

Data extraction
Data visualization
Training a model

491 голосов

1.8k 0 5

Learn data science and Machine Learning

21 Jan 2021, 18:20

Which library in Python can be used to gather data?

Опрос

Beautiful Soup
Numpy
Matplotlib
Scikit-learn

502 голосов

1.7k 0 7

Показано 20 последних публикаций.

2 308

подписчиков

Статистика канала

Язык сайта

Большой курс от TGStat Academy

Новогодний квиз от TGStat Academy

Кот научит как «ловить рыбу»

Learn data science and Machine Learning

Гео и язык канала

Категория

2 308