Exclusion REX

Post Reply
jgdoke
Posts: 167
Joined: Wed Jul 14, 2004 10:52 am

Exclusion REX

Post by jgdoke »

I have this exclustion:
ExpandSection\==[\digit]{1,2},

This page should fit the REX but is still being crawled.
http://www.rockwellautomation.com.ar/ap ... on=10%2C17
Can you explain a bit about this REX as it came from Thunderstone 3 years ago.. (thanks)

More examples of duplicate pages.
ExpandSection=11%2C33
ExpandSection=11%2C32
ExpandSection=11%2C3
AND
ExpandSection=27%2C37%2C1%2C13%2C18
ExpandSection=27%2C36%2C1%2C13%2C18
ExpandSection=27%2C35%2C1%2C13%2C18
ExpandSection=27%2C34%2C1%2C13%2C18
User avatar
mark
Site Admin
Posts: 5514
Joined: Tue Apr 25, 2000 6:56 pm

Exclusion REX

Post by mark »

That means "ExpandSection=" followed by 1 through 2 digits followed by comma. Try

ExpandSection\==[\digit]{1,2}[,%]

to also catch the encoded commas.
jgdoke
Posts: 167
Joined: Wed Jul 14, 2004 10:52 am

Exclusion REX

Post by jgdoke »

that worked great but now there is this new format:

ExpandSection=-5%2C-8%2C-6%2C-9%2C-11

May I ask how to exclude these also?
User avatar
mark
Site Admin
Posts: 5514
Joined: Tue Apr 25, 2000 6:56 pm

Exclusion REX

Post by mark »

I'd need more examples to make a general expression. But to include - in what you have use
ExpandSection\==[\digit\-]{1,2}[,%]

Do you want any "ExpandSection" urls? If not, just use
ExpandSection\=
Post Reply