Page 1 of 1

Exclusion REX

Posted: Fri Nov 30, 2007 4:49 pm
by jgdoke
I have this exclustion:
ExpandSection\==[\digit]{1,2},

This page should fit the REX but is still being crawled.
http://www.rockwellautomation.com.ar/ap ... on=10%2C17
Can you explain a bit about this REX as it came from Thunderstone 3 years ago.. (thanks)

More examples of duplicate pages.
ExpandSection=11%2C33
ExpandSection=11%2C32
ExpandSection=11%2C3
AND
ExpandSection=27%2C37%2C1%2C13%2C18
ExpandSection=27%2C36%2C1%2C13%2C18
ExpandSection=27%2C35%2C1%2C13%2C18
ExpandSection=27%2C34%2C1%2C13%2C18

Exclusion REX

Posted: Fri Nov 30, 2007 5:46 pm
by mark
That means "ExpandSection=" followed by 1 through 2 digits followed by comma. Try

ExpandSection\==[\digit]{1,2}[,%]

to also catch the encoded commas.

Exclusion REX

Posted: Tue Dec 04, 2007 10:22 am
by jgdoke
that worked great but now there is this new format:

ExpandSection=-5%2C-8%2C-6%2C-9%2C-11

May I ask how to exclude these also?

Exclusion REX

Posted: Tue Dec 04, 2007 11:13 am
by mark
I'd need more examples to make a general expression. But to include - in what you have use
ExpandSection\==[\digit\-]{1,2}[,%]

Do you want any "ExpandSection" urls? If not, just use
ExpandSection\=