Page 1 of 1
Parts of page not indexed
Posted: Mon May 18, 2009 3:29 pm
by kiddywood
Webinator is indexing specified pages fine except this type of page:
https://aonuk.taleo.net/careersection/2 ... lang=en_GB
The job titles listed at the bottom of the page are not being indexed. Is there any way to ge these included as well?
Cheers
Parts of page not indexed
Posted: Mon May 18, 2009 4:46 pm
by John
It looks as if those titles are generated by a javascript application running in the page, and that the search form uses ajax like features to update the page in place, which can cause them not to be included in the page, as there is no one contents of the page.
Parts of page not indexed
Posted: Mon May 18, 2009 4:56 pm
by Kai
The job titles at the bottom are being produced by JavaScript. Only limited JavaScript is supported by Webinator, due to the dynamic nature of scripting and limits in the implementation.
Apparently some Webinator-unsupported functions/properties are being called on that page, for example the `firstChild' property, and the `getElementsByName' function. Thus Webinator cannot get the text of that JavaScript-produced section.
Parts of page not indexed
Posted: Tue Dec 11, 2012 3:14 pm
by kiddywood
Hi,
Apologies for dragging up this thread after so long but I'm still trying to find a solution to this problem.
Will it be possible for Webinator to have this functionality in the future or are there any workarounds so that the Javascript-produced content is indexed?
Thanks
Parts of page not indexed
Posted: Tue Dec 11, 2012 3:53 pm
by Kai
It's unlikely ever to work (at least in the near term), as the JavaScript and DOM in Webinator are too limited. The only other workaround I can think of is to find a URL that has static links, or manually/programmatically add them (you might be able to add some Vortex to derive the links from a given URL, for example, if you find they are a standard format).