performance impact of large /etc/hosts files

RSean rseans at gmail.com
Mon Dec 24 23:49:54 PST 2007


Hi guys,

Just curious if anyone has tried regular expressions to handle ads and
banners. 

We have a small network of about 10 users. We use SafeSquid as proxy and
content filter. It supports the use of regex for defining rules.

The URL Filter section has 2 default rules for blocking ads and banners -

Hosts: 
(^ad(|s|v|server)\.|adtag\.|targetsearches.com|webconnect.net|imgis.com|atwola.com|fastclick.net|abz.com|tribalfusion.com|advertising.com|atdmt.com|sp
inbox\.(com|net)|linkexchange.com|hitbox.com|doubleclick.net|valueclick.com|click2net.com|mediaplex.com|247media.com|clickagents.com|adbutler.com|qkim
g.net|realmedia.com|us.a1.yimg.com|clickheretofind.com|images.cybereps.com|adbureau.net|sfads.osdn.com|adflow.com|adprofs.com|zedo.com|digitalmedianet
.com|ad-flow.com|/adsync/|adtech.de|netdirect.nl|rcm-images.amazon.com|pamedia.com|msads.net|valuead.com|smartadserver.com|thisbanner.com|aaddzz.com|s
cripps.com|ru4.com|adtrix.net|falkag.net)

File:
(/adimages/|/banner(|s)/|/ad(|s|v|(|_)banner(|s))/|/adx/|/sponsors/|/advert(ising|s|)/|/adcycle/|/track/|/promo/|/adspace/|/admentor/|/image\.ng/|/ajr
otator/|/adview.php|/clickthru|/affiliates|banmat(\.cgi|.\.cgi)|/adproof/|/bannerfarm/|/BannerAds/|/banner_|sponsorid|/servfu.pl|/RealMedia/|/adsync/|
_ad_|/adceptdelivery.cgi)

I am not a very technical person, but the first rule, I think, is a regex
that defines hosts that serve ads; while the second rule is a regex for
words that the file part of a url may contain.

These rules very efficiently block ads and banners at the gateway, saving
b/w and improving surfing experience.

Just thought I should mention this.

Cheers!
-- 
View this message in context: http://www.nabble.com/performance-impact-of-large--etc-hosts-files-tp14267018p14493715.html
Sent from the freebsd-questions mailing list archive at Nabble.com.



More information about the freebsd-questions mailing list