DA

Kidney Stones & Simpson's Paradox

A new look at an old research study In 1986, a group of urologists in London published a research paper in The British Medical Journal that compared the effectiveness of two different methods to remove kidney stones. Treatment A was open surgery (invasive), and treatment B was percutaneous nephrolithotomy (less invasive). When they looked at the results from 700 patients, treatment B had a higher success rate. However, when they only looked at the subgroup of patients different kidney stone sizes, treatment A had a better success rate.

Investigating Fandango Movie Ratings

In October 2015, a data journalist named Walt Hickey analyzed movie ratings data and found strong evidence to suggest that Fandango’s rating system was biased and dishonest. He published his analysis in this article — a great piece of data journalism that’s totally worth reading. Fandango displays a 5-star rating system on their website, where the minimum rating is 0 stars and the maximum is 5 stars. Hickey found that there’s a significant discrepancy between the number of stars displayed to users and the actual rating, which he was able to find in the HTML of the page.

Analyzing NYC High School Data

One of the most controversial issues in the U.S. educational system is the efficacy of the standardized tests, and whther they’re unfair to certain groups. Investigating the correlation between SAT scores and demographic might be an interesting angle to take. We could correlate SAT scores with factors like race, gender, income, and more. The SAT, or Scholastic Aptitude Test, is an exam that U.S. high school students take before applying to college.

Demand Forecasting of Perishable Products

The objective of this project is to minimize wastage of meal kits in retail stores. Currently, this is being done by tracking each individual item from the source until the point of sale. This is a cumbersome process and is labor intensive. In order to realize the objective using machine learning the first step in the process is to have an accurate forecast of the demand. This project focuses on generating accurate forecast for each individual item (46 unique items) for each store (47 unique stores).

Exploratory Analysis of Hacker News Posts

Hacker news is a social news website focusing on computer science and entrepreneurship. It was started by the startup incubator Y Combinator, where posts are voted and commented on similar to reddit. Posts that make it to the top of the Hacker News’ listings have more frequent visitors as a result. In this project we are interested in the posts that begin with either Ask HN or Show HN. The posts submitted by users which ask the Hacker News community specific questions start with “Ask HN” prefix.

Human Variability in Computer Input Device

Computers are ubiquitously used in a number of jobs and even at home as an aid to facilitate various tasks. Operation devices are used to transfer information to machine and adjust or change the state of machine [1]. Most interaction with a computer involve using either mouse or a keyboard. People most often maintain static, unnatural posture for long hours. Using the mouse means repetitive movements which may cause physiological problems in the arm, wrist and shoulder [2].

Mobile App for Lottery Addiction

Many people start playing the lottery for fun, but for some this activity turns into a habit which eventually escalates into addiction. Like other compulsive gamblers, lottery addicts soon begin spending from their savings and loans, they start to accumulate debts, and eventually engage in desperate behaviors like theft. In this project, we are going to contribute to the development of a mobile app by writing a couple of functions that are mostly focused on calculating probabilities.

TURNING TWEETS INTO KNOWLEDGE

Sentiment analysis is about detecting emotions, opinions of people about certain topics by analyzing their texts from tweets, fb comments or status , youtube comments so on and so forth. Retail industries and companies in general use sentiment analysis to get an overview of their clients’ opinions on their products which enables them to make improvements and certain modifications to their products so that it meets their clients’ standards. There are a lot of social media sites like Google Plus, Facebook, and Twitter that allow expressing opinions, views, and emotions about certain topics and events.

AirBnB: Nearest Neighbors

Introduction AirBnB is a marketplace for short term rentals that allows you to list part or all of your living space for others to rent. You can rent everything from a room in an apartment to your entire house on AirBnB. Because most of the listings are on a short-term basis, AirBnB has grown to become a popular alternative to hotels. The company itself has grown from it’s founding in 2008 to a 30 billion dollar valuation in 2016 and is currently worth more than any hotel chain in the world.

Star Wars: A data exploration

Before the release of “Star Wars: The Force Awakens”, the team at FiveThirtyEight wanted to answer some questions about the Star Wars franchise. In particular they were interested in answering the question Which movie is the best movie in the franchise? The team needed to collect data addressing this question. To do this, they surveyed Star Wars fans using the online tool SurveyMonkey. They received 835 total responses, which you download from their GitHub repository.