gigadata.html
HTML document, Unicode text, UTF-8 text
1<!DOCTYPE html> 2<html lang="en"> 3<head> 4<meta charset="UTF-8"> 5<title> 6Gigadata 7</title> 8<link rel="stylesheet" href="/static/style.css"> 9<meta name="viewport" content="width=device-width, initial-scale=1.0"> 10</head> 11<body> 12<header> 13<nav> 14<ul> 15<li><a href="/">Home</a></li> 16<li><a href="/projects">Projects</a></li> 17<li><a href="/index">Index</a></li> 18<li><a href="/about">About</a></li> 19<li><a href="https://roundabout-host.com/roundabout">Roundabout-host</a></li> 20</ul> 21<ul> 22<li><a href="mailto:root@roundabout-host.com" id="mail-link">root@roundabout-host.com</a></li> 23</ul> 24</nav> 25</header> 26<main> 27 28<div class="project-title"> 29<h1> 30Gigadata 31</h1> 32<a href="https://roundabout-host.com/roundabout/gigadata" class="repository-button">Go to repository</a> 33</div> 34<p class="tags"> 35 36<a href="/index/web.html" class="tag">web</a> 37 38<a href="/index/flask.html" class="tag">flask</a> 39 40<a href="/index/software.html" class="tag">software</a> 41 42<a href="/index/python.html" class="tag">python</a> 43 44<a href="/index/agpl.html" class="tag">agpl</a> 45 46<a href="/index/gigadata.html" class="tag">gigadata</a> 47 48<a href="/index/ai.html" class="tag">ai</a> 49 50<a href="/index/data.html" class="tag">data</a> 51 52<a href="/index/crowdsourcing.html" class="tag">crowdsourcing</a> 53 54<a href="/index/waste detection.html" class="tag">waste detection</a> 55 56<a href="/index/waste.html" class="tag">waste</a> 57 58</p> 59<article class="content-area"> 60<p>Gigadata is an image dataset collection and annotation platform. It allows anyone 61to easily contribute to the dataset by uploading images and annotating objects, 62and to use the dataset for training machine learning models. 63</p><p>The platform is designed to host a single huge dataset, which spans many classes, 64fields of interest, and use cases. Using querying it is easily possible to download 65only the parts that you need, though — for example, to get a JSON of the photos 66which contain either a cat or a dog (assuming these classes are registered on the 67server), and are under a PD equivalent licence: 68</p><pre data-language="yaml">want: 69- has: ["Domestic cat (Felis catus)", "Dog (Canis lupus familiaris)"] 70- nature: ["photo"] 71- licence: ["CC0-1.0", "X-public-domain", "X-informal-do-anything"] 72</pre><p>Classes are hierarchical, so many search problems are solved. For example, consider 73this hierarchy (excuse my text art): 74</p><pre data-language=""> /- Aluminium food container 75/- Aluminium household waste --- Aluminium can 76/- Metal household waste /- Plastic bag 77Household waste --- Plastic household waste --- Plastic bottle \ 78/ / \ 79Bottle ---------/---------------------------/ - PET bottle --- Clear PET bottle 80/ / / 81Plastic object ---- PET object --------------------------------/ / 82\-------------------------- Clear PET object / 83</pre><p>Multiple inheritance is also possible, seen here in Plastic bottle, for instance. 84It's both a Plastic household waste and a Bottle, and because it's a Plastic 85household waste, it's also a Household waste and a Plastic object. All sorts of 86hierarchies like this one are possible; the <code>has</code> filter is used to search for 87an object or its descendants. 88</p><p>There are more APIs, not just the search one. You can upload images, annotate 89them, and manage galleries programmatically. 90</p><p>If one wants more organisation of a certain set of images, a gallery feature is 91available where users can create galleries of images. Other users can also be 92assigned to add images to a gallery. 93</p><p>To prevent vandalism, you cannot change someone else's image annotations, but 94you can copy the image and make the changes; if the owner of the original approves, 95they can mark their version as obsolete and replaced by your version, which causes 96it to disappear from the search results. 97</p><p>The platform is made with Python, Flask and SQLAlchemy, just like the roundabout. 98</p><p>As always, this platform is free/libre under the AGPL. An official instance, Roundabout 99Datasets, is hosted at 100<a href="https://datasets.roundabout-host.com">datasets.roundabout-host.com</a>. Anyone can 101add images there, but they have to be free/libre. Nothing is guaranteed. 102</p><p>As far as I know, there's nothing else like this platform (at least not free/libre). 103<a href="mailto:root@roundabout-host.com">I'd be happy to be proven wrong, though</a>. 104</p><p>And why did I put "waste" in the topics? I'm moving the waste detection dataset 105there. 106</p> 107</article> 108 109</main> 110<footer> 111<p>Page generated on Sunday, 2 February 2025 at 20:57:22</p> 112<p xmlns:cc="http://creativecommons.org/ns#" >This work is marked with <a href="https://creativecommons.org/publicdomain/zero/1.0/?ref=chooser-v1" target="_blank" rel="license noopener noreferrer" style="display:inline-block;">CC0 1.0 Universal</a> (🄍). No rights reserved.</p> 113<p>Hosted at <a href="https://roundabout-host.com/roundabout">Roundabout-host</a> using the static site service, and generated with <a href="/projects/ampoule.html">Ampoule</a>.</p> 114<a href="#">Back to top</a> 115</footer> 116</body> 117</html>