Appsilon at Hack4Enviornment Hackathon – Solving Illegal Waste Disposal Problems with Machine Learning

[This article was first published on r – Appsilon | End­ to­ End Data Science Solutions, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Gift bag we received from the organizers, including the prize our team won.

Appsilon at Hack4Environment

Appsilon has recently taken part in the Hack4Environment – a 24-hour hackathon focusing on dealing with the critical problem of illegal waste disposal (known as fly-tipping). 

Read about our AI4Good initiative – Here’s how Machine Learning can be used to analyze wildlife camera trap datasets.

Long story short – in 24 hours, we proposed a solution that granted us an honorable mention in the competition!

Navigate to a section:

About the Hackathon

As said previously, the hackathon’s goal was to somehow deal with the problem of illegal waste disposal. Our solution was based on two parts:

  1. Application portion – managed by Dominik Krzemiński
  2. Machine learning portion – managed by Jędrzej Świeżewski

We ended up proposing a solution from three different perspectives, which are discussed in the following section.

Our Solution

Our idea was to build an app addressing the issue from the following perspectives:

  • Citizen perspective – citizens could localize sites in their neighborhood with disposed waste and earn recognition for reporting on cleaning it up.
  • Local authority’s perspective – we provide them with insights about the waste’s location on a larger scale, allowing them to direct cleaning efforts.
  • Manufacturer perspective – we build a mechanism for incentivizing the manufacturers of goods that get thrown out as the waste to act on the information that their products are perceived as littering the local communities.

As icing on the cake, we have built and deployed a Shiny application called Kropla. It aims to display the data provided by the hackathon organizers clustered into localized groups of waste items. We have added a mechanism for claiming waste removal and receiving recognition for it. 

In addition, we’ve also trained a computer vision model to detect the brand of a given waste item. The model’s validation accuracy reached 75% on a dataset with 46 classes.

If you have a couple of minutes to spare, please watch our hackathon submission video below. It’s in Polish, so apologies to our non-Polish readers:


Lessons Learned

At the end of the day, there are four main takeaways for us from this hackathon:

  1. Hackathons are a great occasion to bond within the team. Somehow, a chat late at night serves this purpose best.
  2. Agglomerative clustering is perfect for dealing with clustering on huge scale differences. The data we dealt with spanned the globe, while it, of course, makes no sense for a fly-tipping site to be of a size larger than a few tens of meters. Many other clustering algorithms (including the common HDBSCAN) struggle with such data leading to, e.g., entire cities becoming a cluster.
  3. The 75% accuracy reached by our machine learning model does not seem to stand out. When one considers the fact it was an accuracy over 46 classes, it starts feeling very good. However, this is misleading as the data set was heavily imbalanced, with one brand taking up around half of the data. This means a majority-class proxy would have ~50% accuracy regardless of the multitude of classes. Our model’s key strength lies in the fact that it dealt well with some of the tiny classes (for some, the brand was hard to recognize even for a human eye). The takeaway here is that reporting a model’s performance with a single number is typically not sufficient.
  4. Our team had two participants, while each of the three teams that ranked above us had more (3, 5, and 5) – having more people on board, especially in a limited time setting, can be a differentiating factor.

Conclusion

All in all, the hackathon was a lot of fun! Moreover, it touched a globally and locally important ecological issue and helped raise the awareness of waste reduction and fly-tipping.

We would like to congratulate the other winners, thank the organizers and applaud all the participants for their effort. We regret that we could not gather in person due to the circumstances and mingle with all the people involved in the event to get to know each other beyond the chat communication.

Learn More

Appsilon is hiring for remote roles! See our Careers page for all open positions, including R Shiny DevelopersFullstack EngineersFrontend Engineers, a Senior Infrastructure Engineer, and a Community Manager. Join Appsilon and work on groundbreaking projects with the world’s most influential Fortune 500 companies.

Article Appsilon at Hack4Enviornment Hackathon – Solving Illegal Waste Disposal Problems with Machine Learning comes from Appsilon | End­ to­ End Data Science Solutions.

To leave a comment for the author, please follow the link and comment on their blog: r – Appsilon | End­ to­ End Data Science Solutions.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)