Data Science and Machine Learning Articles


Mountains

Create your experimental design with a simple Python command

Design of Experiment (DOE) is an important activity for any scientist, engineer, or statistician planning to conduct experimental analysis. This article discusses a new open-source Python package which can create key designs (factorial, randomized, response-surface, etc.) from a simple input CSV file.

Click here to read the article
Mountains

Why you should start using .npy file more often…

Data science needs fast computation and transformation of data. Native NumPy objects in Python provides that advantage over regular programming objects. It works for as simple a task as reading numeric data set from a file on the disk. We demonstrate the advantage with few lines of code.

Click here to read the article
Mountains

Introducing a simple and intuitive Python API for UCI machine learning repository

Introducing a simple and intuitive API for UCI machine learning portal, where users can easily look up a data set description, search for a particular data set they are interested, and even download datasets categorized by size or machine learning task.

Click here to read the article.
Mountains

How the good old sorting algorithm helps a great machine learning technique

In this article, we showed that how the simple sorting algorithm is at the heart of solving an important problem in computational geometry and how that relates to a widely used machine learning technique.

Click here to read the article.
Mountains

Some Essential Hacks and Tricks for Machine Learning with Python

In this article, we discuss some cool tricks and tips about machine learning with Python. Some key online MOOC courses and blogs are mentioned. Most essential machine learning libraries (NumPy, Matplotlib, Seaborn, Pandas, Scikit-Learn, Statsmodels, TensorFlow, etc.) and their features are mentioned.

Click here to read the article .
Mountains

Introducing pydbgen: A random dataframe/database table generator

Often, beginners in SQL or data science struggle with the matter of easy access to a large sample database file (.DB or .sqlite) for practicing SQL commands. Would it not be great to have a simple tool or library to generate a large database with multiple tables, filled with data of one’s own choice?

Click here to read the article.
Mountains

Data Analytics with Python by Web scraping: Illustration with CIA World Factbook

In this article, we show how to use Python libraries (URLlib, BeautifulSoup4, etc.), Regex programming, and HTML parsing to extract useful information from a website and answer some important analytics questions afterwards.

Click here to read the article .
Mountains

Step-by-step guide to build your own ‘mini IMDB’ database

In this article, I will show how to use simple Python libraries and built-in capabilities to scrape the web for movie information and store them in a local SQLite database, which can later be queried for data analytics with movie info. Think of this as a project to build your own mini IMDB database!

Click here to read the article.