Contexte

The Université Toulouse 1 Capitole propose a blacklist managed by Fabrice Prigent from many years, to help administrator to regulate Internet use. This database, often used in school, can be used with many commercial or free software.
Be careful : this list should not be seen as a "to be block". It must be seen as a "web categorization" : some categories can be blocked or allowed, depending on your environnement..

Licenses

Contrat Creative Commons
This creation is available under un Creative Commons Contract.

Description

Many categories are defined, but it's the main one is "pornography".
We are actively working on the adult database, but you can help me with the others one.
I add between 50 and 300 urls per day.
A archive file contain all category : blacklists.tar.gz
CategoryNumberDescription
adult4658610 Some adult site from erotic to hard pornography.
agressif396 Some aggressive sites.
ai74 Site which provides artificial intelligence
arjel69 ARJEL which is a french certification authority for gambling sites
associations_religieuses1 religious_association
astrology29 Astrology
audio-video3872 Some audio and video sites.
bank6646 Online bank
bitcoin1426 Sites for bitcoin mining
blog1500 Some blogs sites.
celebrity674 Famous people, actors, and magazine which talk about them
chat282 Chat site
child77 Any website allowed to child (less than 10 years old)
cleaning177 Sites to disinfect, update and protect computers.
cooking37 Sites for cooking
cryptojacking16289 Mining site by hijacking
dangerous_material54 Sites which describe how to make bomb and some dangerous material.
dating6378 Dating, matching site for single person
ddos421 DDoS or Stresser Sites
dialer4 Dialer Sites
doh3016 Site which provides DNS over HTTP service
download4035 Sites which propose to download software
drogue1067 Sites relative to drugs.
dynamic-dns2076 Site which provides dynamic-dns
educational_games11 educational games sites (flash and online games )
examen_pix347 A list reserved exclusively for French students taking the PIX exam. DO NOT USE in other circumstances
fakenews1094 Site which provides fakenews
filehosting943 Websites which host files (pictures, video, ...)
financial473 Sites relative financial information.
forums225 Forums site.
gambling32218 Gambling and games sites, casino, etc.
games35349 games sites (flash and online games )
hacking307 Hacking sites.
jobsearch429 Site to looking for job
lingerie171 Sites for lingerie
liste_bu2916 A french list for educational sites. VERY locally oriented. may help libraries.
malware273617 Any website which deliver malware
manga843 Any website related to manga, and cartoons
marketingware79 Very special marketing sites
mixed_adult156 Websites which contains adult sections unstructured
mobile-phone52 Sites for mobile phone (rings, etc).
phishing273594 Phishing sites (same as malware category)
press4645 Any press (informational) site
publicite4644 Advertisement.
radio577 Internet radio sites
reaffected8 Websites which have been reaffected
redirector132616 Some redirector sites, which are used to circumvent filtering.
remote-control171 site which allow remote control of user s dekstop
residential-proxies138 Site which provides residential-proxies
sect145 Sect
sexual_education20 Website which talk about sexual education, and can be misdetected as porn
shopping36971 Any shopping, selling center
shortener4541 URLs shortening sites
social_networks716 All social networks sites
sports2363 Sports
stalkerware525 Site which sells spying software for everybody
strict_redirector132344 Same as redirector, but with google, yahoo, and other cache/images search robots.
strong_redirector132344 Same as strict_redirector, but, for google, yahoo, we are only blocking some terms.
translation179 Sites for translation
tricheur73 Sites which are designed to explains cheating on exams.
tricheur_pix85 DO NOT USE. It s a specific blacklist for french exam.
update33 Update sites for software or OS
vpn6038 VPN site
warez1531 Warez sites.
webhosting25 Site which provides webhosting
webmail413 Webmail sites (hotmail like...)
These lists contain certainly some mistakes. If you find some, send me a mail : fabrice.prigent@ut-capitole.fr or you can use this interface https://dsi.ut-capitole.fr/cgi-bin/squidguard_modify.cgi.

How are these lists constituted

2 ways : Database is renewed more or less, twice a week.

Contributors

This database exists because many contributors help :

malware and marketingware

These bases are based on the work of :

Other database

Some database exist in other countries, but most of them have disappeared. Last ones :

How to download

Other informations