September 14 Changes (HHH): --------------------------- 1. Action: Allow all "analo" words at start to pass From: BadURL_WordStarts[i++]="anal[^y]"; To: BadURL_WordStarts[i++]="anal[^oy]"; Reason: To prevent false positives like analog, analogy, etc. 2. Action: Downgrade "pix" from all of URL to just HOST From: BadURL_Parts[i++] = "pix"; To: BadHostParts[i++] = "pix"; Reason: Just some of the false positives detected by this pattern are: g-images.amazon.com/images/G/01/x-locale/common/\ transparent-pixel.gif regnow.img.digitalriver.com/images/pixel.gif www.inkjetsuperstore.com/site/img/misc/pix.gif www.inkjetsuperstore.com/s.nl/sc.2/category.42/site/\ img/misc/pix.gif www.inkjetsuperstore.com/site/img/misc/pix.gif www.inkjetsuperstore.com/s.nl/sc.2/category.1742/site/\ img/misc/pix.gif www.nextag.com/imagefiles/style/gray_pixel.gif NOT using "pix" on hosts: 570 / 320 / 267 (-250, -53) USING "pix" on hosts: 570 / 0 / 0 (-570, -0) We will have to live with false positives with hosts UNTIL other rules can take up the slack, IF they can take up the slack. 3. Action: NO CHANGE From: BadURL_WordEnds[i++]="[^sc]hard To: BadURL_WordEnds[i++]="[^cs]hard Reason: put in alphabetical order for nitpickers 4. Action: NO CHANGE From: ... is the breackets [] (line 56) // GoodDomains at the end // BadDomains at the end // BadHostParts okay anywhere // BadHostWordStarts at the begining // BadHostWordEnds at the end // // BadURL_Parts okay anywhere // BadURL_WordStarts at the begining // BadURL_WordEnds at the end To: ... is the brackets [] (line 56) // GoodDomains at the end // BadDomains at the end // // BadURL_Parts okay anywhere // BadHostParts okay anywhere // // BadURL_WordStarts at the begining // BadHostWordStarts at the begining // // BadURL_WordEnds at the end // BadHostWordEnds at the end Reason: To match where things are at in the file. On this topic, you CAN use NotePad if you want, but I advise getting a REAL editor that can be set to be CASE SENSITIVE. Make all names in the comments header MATCH *EXACTLY* what they are down below. You don't have an editor? I use both GVIM and MicroEMACS. Pick your own poison. GVIM: http://www.vim.org/ http://www.vim.org/download.php#pc (I would suggest the gvim70.exe) 5. Action NO CHANGE From: No comments by arrays Reason: REGEXP warnings by all affected arrays To prevent somebody from putting a REGEXP in the WRONG place.