MeanNearestNeighbors (MNN) - algorithm for balancing dataset - In progress #1

Image
One of the challenges in classification problems are unbalanced datasets. I was Data Science Intern when the company that I worked for, assigned me such an interesting challenge where the dataset was unbalanced.  However, I realized this type of problem like unbalanced dataset is а common thing in real life. I tried most of the algorithms (undersampling, oversampling) like SMOTE, NearMiss, CondensedNearestNeighbors, RandomUnderSampler, RandomOverSampler,  KMeansSMOTЕ and rest of them. Anyway, they didn't help me in that case, on the contrary, they worsened my model.  I was like: "but, but, you should have been helpful in creating the predictive model" So, I'm trying to create another algorithm based on undersampling concept when it comes to balancing datasets. I called it Mean Nearest Neighbors (MNN). What's the initial idea: It's simple. Actually, the algorithm is just a modification of the other undersampling algorithms. In the data where target labe...

Technology Things #3: What are some mind-blowing facts related to technology?


  • The world wide web was created using NEXT computer, which does not exist now.
  • Google hired a camel to create street view in desert.
  • Sophia is the first ever robot to get a citizenship of Saudi Arabia. She was provided citizenship in October, 2017.
  • If Facebook was a country, then it would have been the world's most populated country.
  • One Bitcoin transaction consumes 3,994 times more energy than a credit card transaction.
  • Candy Crush has more active monthly users than the entire population of Canada.
  • Apple iPad would cost $14,970 (₹ 9,69,510) if made in the USA.
  • Microsoft makes more money from Samsung's Android than they do from Windows phones, Skype and Xbox combined.
  • As per the statistic of 2016, 54,907 Google searches, 7,252 tweets, 125,406 YouTube video views and 2,501,018 E-mails are sent per second in Internet.
  • South Korea had the fastest average internet connection in the world.

Popular posts from this blog

Math Problem -> Combinatorics: Foreign alphabet

Competitive Programming #29 : [LineUp]

Intro to Quantum Computing: Што ќе ми треба ова сега? #1