Posts

Showing posts from November, 2020

RECOGNIZING HANDWRITTEN DIGITS

Image
RECOGNIZING HANDWRITTEN DIGITS WITH  SCIKIT-LEARN Hypothesis to be tested : The Digits data set of scikit-learn library provides numerous data-sets that are useful for testing many problems of data analysis and prediction of the results. Some Scientist claims that it predicts the digit accurately 95% of the times. Libraries : scikit-learn , matplotlib INSTALL LIBRARIES We can analyse image as well as sound using data analysis too. Here in this project we are going to analyse the images. For any data analysis project we are using jupyter notebook as an editor same as here. Now by using pip install sklearn and matplotlib. from sklearn import datasets,svm svc = svm.SVC(gamma=0.001, C=100.) digits = datasets.load_digits() Now for getting information about the digits dataset use following command and read the description given: print(digits.DESCR) After hitting above command you will get the desciption as: .. _digits_dataset: Optical recognition of handwritten digits dataset --------------

GLOBAL WARMINGS EFFECT ON TEMPERATURE AND HUMIDITY

Image
PERFORMING ANALYSIS OF METEOROLOGICAL DATA Hypothesis to be tested : The Influence of Global Warming on temperature and humidity. Libraries:Pandas, scikit-learn , matplotlib INSTALL LIBRARIES For any data analysis project we are using jupyter notebook as a editor. By using pip install numpy, pandas, matplotlib and seaborn. Now start with follwing code: import pandas as pd import numpy as np %matplotlib inline import matplotlib.pyplot as plt LOAD DATA For this project we are using a csv file which is available there on kaggle platform. Now import this csv file in the project. You can get this csv file on the link below: Dataset df = pd.read_csv('weatherHistory.csv') df.head() CHECKING SHAPE, DTYPE & NULL VALUES For analysing any dataset we first need to check dataset size,shape and is there any null values present, if yes then we have to clean it but here we are getting a clean dataset. df.shape df.dtypes df.isnull().sum() FORMATING DATE For our hypothesis we need to manage