No documents match the query ...

ferruccio_manclossi
Posts: 9
Joined: Mon Dec 03, 2001 5:53 am

No documents match the query ...

Post by ferruccio_manclossi »

HI, novice question again.
I installed webinator starting as root user and selecting nobody as webinator user.
Webinator files and directory are all available for nobody user.

I indexed my site creating a defualt profile but every search I tried return always "No documents match the query." message ...

Any idea?

This is my profilewalk log (last rows)
===========================
Current run: Refresh STOP walk (auto refresh in seconds) Help

Webinator Walk Report for default

Creating database /Search/Webinator/default/db2...Done.
Walk started at 2001-12-05 17:39:29 (by user)
Start fetching at http://jplaza.italy.ibm.com/
Ignore urls containing any of the following:
/cgi-bin/
~
?

started 1 (17032) on http://jplaza.italy.ibm.com/
1900 pages fetched (40,854,176 bytes) from http://jplaza.italy.ibm.com/
Dispatcher stopping by request.

1900 pages (40,854,176 bytes) so far. 186 errors so far. 481 duplicate pages so far. 1900 http://jplaza.italy.ibm.com/Presentatio ... de0029.htm (5,870 bytes) 1899 http://jplaza.italy.ibm.com/Presentatio ... de0028.htm (5,411 bytes) 1898
....

Thanks in advance
Ferruccio
User avatar
Kai
Site Admin
Posts: 1271
Joined: Tue Apr 25, 2000 1:27 pm

No documents match the query ...

Post by Kai »

Check the HTML source of the results page for errors in comments; are there any? You might see something like "Query would require linear search".

You apparently stopped the walk before it completed ("Dispatcher stopping by request"), and search indexes were not built on the walk data. Therefore a search would fail with an error like the above. Let the walk complete and build indexes.
ferruccio_manclossi
Posts: 9
Joined: Mon Dec 03, 2001 5:53 am

No documents match the query ...

Post by ferruccio_manclossi »

after 3 days initial walk is still running but appears hanged after 1855 pages reached in the first day...

How can I verify what it's doing?
Search is not yet active...
===== walk log ========
Walk Status
Current Profile: default
Webinator 4.0
Current run: Refresh STOP walk (auto refresh in seconds) Help

Webinator Walk Report for default

Creating database /Search/Webinator/default/db2...Done.
Walk started at 2001-12-06 17:12:51 (by user)
Start fetching at http://jplaza.italy.ibm.com/
Ignore urls containing any of the following:
/cgi-bin/
~
?

started 1 (18508) on http://jplaza.italy.ibm.com/
1855 pages fetched (36,031,703 bytes) from http://jplaza.italy.ibm.com/

1855 pages (36,031,703 bytes) so far. 163 errors so far. 484 duplicate pages so far. 1855 http://jplaza.italy.ibm.com/Presentatio ... de0029.htm (5,870 bytes) 1854
====== ps -eaf | grep -E "tex|monitor" ===========
nobody 1998 1 0 Dec 06 - 7:05 monitor -d /Search/Webinator/default/db2/ -z
nobody 5522 1 0 Dec 05 - 8:41 monitor -d /usr/local/morph3/texis/testdb/ -z
nobody 15312 1 0 Dec 05 - 5:57 monitor r
nobody 15770 1 5 Dec 05 - 215:57 ./texis profile=default /usr/lpp/internet/server_root/pub/webinator/dowalk/dispatch.txt
nobody 18288 1 1 Dec 06 - 79:59 ./texis profile=default /usr/lpp/internet/server_root/pub/webinator/dowalk/dispatch.txt
User avatar
mark
Site Admin
Posts: 5514
Joined: Tue Apr 25, 2000 6:56 pm

No documents match the query ...

Post by mark »

hmmm...
It should also list the last several Urls fetched. What were they?
What settings have you changed from default?

Check the /Search/Webinator/default/db2 directory to see if anything's being updated. If not, hit STOP. Try again with "Threads" set to 1.
ferruccio_manclossi
Posts: 9
Joined: Mon Dec 03, 2001 5:53 am

No documents match the query ...

Post by ferruccio_manclossi »

Stop and rerun with thread=1 and Server=1
Indexing seems hanged after 4 hours ...

Any idea? Is it just a *long* process?

thanks in advance
Ferruccio
=== Walk Log ====
Free Webinator
Upgrade to Commercial Webinator [Thunderstone]
Walk Status
Current Profile: default
Webinator 4.0
Current run: Refresh STOP walk (auto refresh in seconds) Help

Webinator Walk Report for default

Creating database /Search/Webinator/default/db2...Done.
Walk started at 2001-12-10 12:58:47 (by user)
Start fetching at http://jplaza.italy.ibm.com/
Ignore urls containing any of the following:
/cgi-bin/
~
?

started 1 (12922) on http://jplaza.italy.ibm.com/
2613 pages fetched (29,567,700 bytes) from http://jplaza.italy.ibm.com/

2613 pages (29,567,700 bytes) so far. 269 errors so far. 413 duplicate pages so far. 2613 http://jplaza.italy.ibm.com/Presentatio ... de0029.htm (5,870 bytes) 2612 http://jplaza.italy.ibm.com/Presentatio ... de0028.htm (5,411 bytes) 2611 http://jplaza.italy.ibm.com/Presentatio ... de0027.htm (5,633 bytes) 2610 http://jplaza.italy.ibm.com/Presentatio ... de0026.htm (4,792 bytes) 2609 http://jplaza.italy.ibm.com/Presentatio ... de0025.htm (6,361 bytes) 2608 http://jplaza.italy.ibm.com/Presentatio ... de0024.htm (5,198 bytes) 2607 http://jplaza.italy.ibm.com/Presentatio ... de0023.htm (6,330 bytes) 2606 http://jplaza.italy.ibm.com/Presentatio ... de0022.htm (6,162 bytes) 2605 http://jplaza.italy.ibm.com/Presentatio ... de0021.htm (6,658 bytes) 2604 http://jplaza.italy.ibm.com/Presentatio ... de0020.htm (6,237 bytes) 2603 http://jplaza.italy.ibm.com/Presentatio ... de0018.htm (2,254 bytes) 2602 http://jplaza.italy.ibm.com/Presentatio ... de0019.htm (4,435 bytes) 2601 http://jplaza.italy.ibm.com/Presentatio ... de0016.htm (3,773 bytes) 2600 http://jplaza.italy.ibm.com/Presentatio ... de0015.htm (6,599 bytes) 2599 http://jplaza.italy.ibm.com/Presentatio ... de0017.htm (4,995 bytes) 2598 http://jplaza.italy.ibm.com/Presentatio ... de0014.htm (3,889 bytes) 2597 http://jplaza.italy.ibm.com/Presentatio ... de0013.htm (6,303 bytes) 2596 http://jplaza.italy.ibm.com/Presentatio ... de0012.htm (5,428 bytes) 2595 http://jplaza.italy.ibm.com/Presentatio ... de0011.htm (5,923 bytes) 2594 http://jplaza.italy.ibm.com/Presentatio ... de0010.htm (2,649 bytes) 2593 http://jplaza.italy.ibm.com/Presentatio ... de0009.htm (3,955 bytes) 2592 http://jplaza.italy.ibm.com/Presentatio ... de0008.htm (4,680 bytes) 2591 http://jplaza.italy.ibm.com/Presentatio ... de0007.htm (4,335 bytes) 2590 http://jplaza.italy.ibm.com/Projects/HA ... MFW3b.html (32,558 bytes) 2589 http://jplaza.italy.ibm.com/Presentatio ... de0006.htm (9,236 bytes) 2588 http://jplaza.italy.ibm.com/Software/PC ... earch.html (19,455 bytes) 2587 http://jplaza.italy.ibm.com/Software/PC ... in_use.htm (50,013 bytes) 2586 http://jplaza.italy.ibm.com/Software/PC ... l-esc.html (15,686 bytes) 2585 http://jplaza.italy.ibm.com/Software/PC ... Terminals/ (6,416 bytes) 2584 http://jplaza.italy.ibm.com/Software/PC ... ts/FTPclt/ (6,365 bytes) 2583 http://jplaza.italy.ibm.com/Projects/HA ... srela.html (43,657 bytes) 2582 http://jplaza.italy.ibm.com/Projects/HA ... study.html (104,198 bytes) 2581 http://jplaza.italy.ibm.com/Projects/HA ... ntrol.html (14,967 bytes) 2580 http://jplaza.italy.ibm.com/Presentatio ... de0005.htm (6,229 bytes) 2579 http://jplaza.italy.ibm.com/Software/to ... /jpmap.txt (312 bytes) 2578 http://jplaza.italy.ibm.com/Software/to ... jpmap.html (5,833 bytes) 2577 http://jplaza.italy.ibm.com/Software/to ... moCSS.html (5,646 bytes) 2576 http://jplaza.italy.ibm.com/Software/to ... Plaza.html (724 bytes) 2575 http://jplaza.italy.ibm.com/Software/UN ... 33/estrai/ (105,798 bytes) 2574 http://jplaza.italy.ibm.com/Software/Ti ... min.sh.txt (2,865 bytes) 2573 http://jplaza.italy.ibm.com/Software/Ti ... v_nv6k.txt (16,954 bytes) 2572 http://jplaza.italy.ibm.com/Software/Ti ... llPCAG.txt (2,497 bytes) 2571 http://jplaza.italy.ibm.com/Software/Ti ... rp_sup.txt (724 bytes) 2570 http://jplaza.italy.ibm.com/Software/Ti ... /fixdb3.2/ (6,180 bytes) 2569 http://jplaza.italy.ibm.com/Software/Ti ... oup.pl.txt (1,302 bytes) 2568 http://jplaza.italy.ibm.com/Software/Ti ... status.txt (820 bytes) 2567 http://jplaza.italy.ibm.com/Software/Ti ... hgname.txt (3,583 bytes) 2566 http://jplaza.italy.ibm.com/Software/Ti ... deDown.NT/ (6,452 bytes) 2565 http://jplaza.italy.ibm.com/Software/Ti ... olMeth.txt (559 bytes) 2564 http://jplaza.italy.ibm.com/Software/Ti ... y/scripts/ (6,878 bytes) 2563 http://jplaza.italy.ibm.com/Software/Ti ... y/Lab_Rim/ (6,569 bytes) 2562 http://jplaza.italy.ibm.com/Software/PC ... table.html (3,982 bytes) 2561 http://jplaza.italy.ibm.com/Software/PC ... nu-faq.txt (130,226 bytes) 2560 http://jplaza.italy.ibm.com/Software/PC ... zip360.txt (46 bytes) 2559 http://jplaza.italy.ibm.com/Software/PC ... f/Mappers/ (6,797 bytes) 2558 http://jplaza.italy.ibm.com/Software/PC ... f/Editors/ (6,673 bytes) 2557 http://jplaza.italy.ibm.com/Software/PC ... t/rskit40/ (6,398 bytes) 2556 http://jplaza.italy.ibm.com/Software/PC ... gmt/Utils/ (6,687 bytes) 2555 http://jplaza.italy.ibm.com/Software/PC ... t/Monitor/ (7,620 bytes) 2554 http://jplaza.italy.ibm.com/Software/PC ... k/Servers/ (6,416 bytes) 2553 http://jplaza.italy.ibm.com/Software/PC ... MiscUtils/ (6,780 bytes) 2552 http://jplaza.italy.ibm.com/Software/PC ... k/Clients/ (6,664 bytes) 2551 http://jplaza.italy.ibm.com/Projects/Zu ... _1998.html (14,584 bytes) 2550 http://jplaza.italy.ibm.com/Projects/HA ... ticon.html (408 bytes) 2549 http://jplaza.italy.ibm.com/Projects/HA ... index.html (1,178 bytes) 2548 http://jplaza.italy.ibm.com/Projects/Enidata_RC36.html (17,528 bytes) 2547 http://jplaza.italy.ibm.com/Projects/Ca ... ona99.html (20,775 bytes) 2546 http://jplaza.italy.ibm.com/Presentatio ... de0004.htm (6,404 bytes) 2545 http://jplaza.italy.ibm.com/cisco/prese ... merce1.htm (6,930 bytes) 2544 http://jplaza.italy.ibm.com/Tips/tapehelp.html (253,139 bytes) 2543 http://jplaza.italy.ibm.com/Tips/mworld_prj.html (66,343 bytes) 2542 http://jplaza.italy.ibm.com/Tips/files/info2.html (5,431 bytes) 2541 http://jplaza.italy.ibm.com/Tips/files/info1.html (13,351 bytes) 2540 http://jplaza.italy.ibm.com/Tips/datasheet.html (15,198 bytes) 2539 http://jplaza.italy.ibm.com/Tips/auto_inv32.html (33,844 bytes) 2538 http://jplaza.italy.ibm.com/Tips/archit ... arket.html (296,639 bytes) 2537 http://jplaza.italy.ibm.com/Tips/Tips8.html (6,662 bytes) 2536 http://jplaza.italy.ibm.com/Tips/Tips735.html (14,327 bytes) 2535 http://jplaza.italy.ibm.com/Tips/Tips730.html (6,185 bytes) 2534 http://jplaza.italy.ibm.com/Tips/Tips659.html (12,491 bytes) 2533 http://jplaza.italy.ibm.com/Tips/Tips656.html (2,975 bytes) 2532 http://jplaza.italy.ibm.com/Tips/Tips653.html (12,214 bytes) 2531 http://jplaza.italy.ibm.com/Tips/Tips38.html (3,199 bytes) 2530 http://jplaza.italy.ibm.com/Tips/Tips329.html (3,230 bytes) 2529 http://jplaza.italy.ibm.com/Tips/Tips323.html (5,420 bytes) 2528 http://jplaza.italy.ibm.com/Tips/Tips322.html (3,566 bytes) 2527 http://jplaza.italy.ibm.com/Tips/Tips316.html (5,327 bytes) 2526 http://jplaza.italy.ibm.com/Tips/Tips315.html (4,850 bytes) 2525 http://jplaza.italy.ibm.com/Tips/Tips314.html (4,850 bytes) 2524 http://jplaza.italy.ibm.com/Tips/Tips312.html (3,463 bytes) 2523 http://jplaza.italy.ibm.com/Tips/Tips311.html (4,890 bytes) 2522 http://jplaza.italy.ibm.com/Tips/Tips310.html (5,388 bytes) 2521 http://jplaza.italy.ibm.com/Tips/Tips309.html (5,387 bytes) 2520 http://jplaza.italy.ibm.com/Tips/Tips296.html (3,786 bytes) 2519 http://jplaza.italy.ibm.com/Tips/Tips237.html (5,116 bytes) 2518 http://jplaza.italy.ibm.com/Tips/Tips180.html (13,945 bytes) 2517 http://jplaza.italy.ibm.com/Tips/Tips123.html (31,390 bytes) 2516 http://jplaza.italy.ibm.com/Software/tools/MyMapper/ (6,649 bytes) 2515 http://jplaza.italy.ibm.com/Software/fixdist/fddb/ (15,688 bytes) 2514 http://jplaza.italy.ibm.com/Software/UNIX/led.txt (96,413 bytes)

==== ps -eaf | grep -E " tex|monitor" ====
nobody 5532 1 0 12:58:42 - 0:13 monitor -d /Search/Webinator/default/db2/ -z
nobody 15312 1 0 Dec 05 - 6:14 monitor r
nobody 18292 1 0 12:55:56 - 0:13 monitor -d /usr/local/morph3/texis/testdb/ -z
nobody 18530 1 1 12:58:38 - 1:22 ./texis profile=default /usr/lpp/internet/server_root/pub/webinator/dowalk/dispatch.txt
====
Webinator user=nobody
=== last modified files into /Search/Webinator ====
db1 directory:
-rw-rw-rw- 1 nobody nobody 125664 Dec 10 15:53 SYSLOCKS
-rw-rw-rw- 1 nobody nobody 8 Dec 10 15:53 SYSLOCKS.SEQ
-rw------- 1 nobody nobody 1328 Dec 10 15:49 SYSSTATS.tbl
-rw------- 1 nobody nobody 27176 Dec 06 17:11 SYSCOLUMNS.tbl
-rw------- 1 nobody nobody 5790 Dec 06 17:11 SYSINDEX.tbl

db2 directory:
-rw-rw-rw- 1 nobody usr 125664 Dec 10 15:51 SYSLOCKS
-rw-rw-rw- 1 nobody usr 8 Dec 10 15:51 SYSLOCKS.SEQ
-rw------- 1 nobody nobody 1328 Dec 10 14:58 SYSSTATS.tbl
-rw------- 1 nobody nobody 3336 Dec 10 13:40 counts.tbl
-rw------- 1 nobody nobody 11869018 Dec 10 13:40 html.tbl
User avatar
mark
Site Admin
Posts: 5514
Joined: Tue Apr 25, 2000 6:56 pm

No documents match the query ...

Post by mark »

Any single page fetch should not take very long unless the page is huge or the webserver or connection is slow. Besides, the timeout would cut off a long downloading page. I can't try what you're doing since that server is, apparently, not on the net. I assume the webserver is still responsive to a web browser when this occurs? And that new urls stop appearing in the list the the file modification times in the database stop changing. You didn't mention what other settings you may have changed. Shall I assume "none"?

The webserver doesn't have some self protection mechanism that blocks users that fetch too many pages too quickly does it?
ferruccio_manclossi
Posts: 9
Joined: Mon Dec 03, 2001 5:53 am

No documents match the query ...

Post by ferruccio_manclossi »

I deleted all my previous setting and I tried a new base scan with the following parameters.
The scan, after reached 10 pages seems hangs and not terminated (I cannot search again...)

My Web has no self-protection mechanism (utility like webzip run fines...).
It's an intranet server (less than 5.000 pages) ...

Any suggestions?
Ferruccio
Basic Walk Settings:
================================
Basic Walk Settings
Current Profile: base
Webinator 4.0
Database ? /Search/Webinator/base/db1
Walk Summary ? New walk started: 2001-12-11 12:13:47 (by user)
Base URL ? http://jplaza.italy.ibm.com/
Enterprise ?Yes - not flagged Domain italy.ibm.com
Robots ?robots.txt: Y Meta: Y
Extensions ? .html .htm .txt
Exclusions ? /cgi-bin/ ~ ?
Crawl Delay ? 0
Parallelism ?Threads:1 Servers: 2
Verbosity ? 2
Rewalk Type ?NEW
Rewalk Schedule ? Frequency NONE Hour NONE
Watch URL ? -empty-
Notify ? -empty-

==== All Walk Setting ====
all defaults except MaxPages=10

===== Walk Status after 1 hours =====
Walk Status
Current Profile: base
Webinator 4.0
Current run: Refresh STOP walk (auto refresh in seconds) Help

Webinator Walk Report for base

Creating database /Search/Webinator/base/db2...Done.
Walk started at 2001-12-11 12:13:47 (by user)
Start fetching at http://jplaza.italy.ibm.com/
Ignore urls containing any of the following:
/cgi-bin/
~
?

started 1 (19040) on http://jplaza.italy.ibm.com/
Maxpages of 10 reached.
10 pages fetched (176,891 bytes) from http://jplaza.italy.ibm.com/

10 pages (176,891 bytes) so far. 0 errors so far. 0 duplicate pages so far. 10 http://jplaza.italy.ibm.com/biblio/ (2,330 bytes) 9 http://jplaza.italy.ibm.com/TivCorner.html (9,318 bytes) 8 http://jplaza.italy.ibm.com/Tips/ (11,938 bytes) 7 http://jplaza.italy.ibm.com/Send.html (7,150 bytes) 6 http://jplaza.italy.ibm.com/Resource.html (7,882 bytes) 5 http://jplaza.italy.ibm.com/Presentation.html (10,235 bytes) 4 http://jplaza.italy.ibm.com/News.html (19,773 bytes) 3 http://jplaza.italy.ibm.com/Link.html (93,771 bytes) 2 http://jplaza.italy.ibm.com/Download.html (7,179 bytes) 1 http://jplaza.italy.ibm.com/ (7,315 bytes)

==== ps -eaf | grep -E "tex|moni" =======
nobody 5544 1 1 12:13:41 - 0:07 ./texis profile=base /usr/lpp/internet/server_root/pub/webinator/dowalk/dispatch.txt
nobody 7660 1 0 12:13:44 - 0:00 monitor -d /Search/Webinator/base/db2/ -z
nobody 12944 1 0 12:07:39 - 0:00 monitor -d /usr/local/morph3/texis/testdb/ -z
nobody 18302 1 0 12:07:39 - 0:00 monit
nobody 20012 1 1 12:11:39 - 0:00 monitor -d /Search/Webinator/base/db1/ -z

======= ls -lt /Search/Webinator/base ======
total 6
drwxrwxrwx 2 nobody nobody 1024 Dec 11 12:25 db1
-rw-rw-rw- 1 nobody nobody 385 Dec 11 12:14 db2.long
-rw-rw-rw- 1 nobody nobody 59 Dec 11 12:13 summary
drwxrwxrwx 2 nobody nobody 1024 Dec 11 12:13 db2

============ls -lt db1 ===========
total 69
-rw-rw-rw- 1 nobody nobody 125664 Dec 11 12:28 SYSLOCKS
-rw-rw-rw- 1 nobody nobody 8 Dec 11 12:28 SYSLOCKS.SEQ
-rw------- 1 nobody nobody 1328 Dec 11 12:25 SYSSTATS.tbl
-rw------- 1 nobody nobody 27176 Dec 11 12:11 SYSCOLUMNS.tbl
-rw------- 1 nobody nobody 5782 Dec 11 12:11 SYSINDEX.tbl
-rw------- 1 nobody nobody 4714 Dec 11 12:11 SYSPERMS.tbl
-rw------- 1 nobody nobody 10556 Dec 11 12:11 SYSTABLES.tbl
-rw------- 1 nobody nobody 3314 Dec 11 12:11 categories.tbl
-rw------- 1 nobody nobody 3336 Dec 11 12:11 counts.tbl
-rw------- 1 nobody nobody 3314 Dec 11 12:11 error.tbl
-rw------- 1 nobody nobody 3314 Dec 11 12:11 options.tbl
-rw------- 1 nobody nobody 3314 Dec 11 12:11 querylog.tbl
-rw------- 1 nobody nobody 3314 Dec 11 12:11 todo.tbl
-rw------- 1 nobody nobody 152 Dec 11 12:11 xcatno.btr
-rw------- 1 nobody nobody 152 Dec 11 12:11 xerrorurl.btr
-rw------- 1 nobody nobody 152 Dec 11 12:11 xoptname.btr
-rw------- 1 nobody nobody 152 Dec 11 12:11 xoptstr.btr
-rw------- 1 nobody nobody 152 Dec 11 12:11 xqueryid.btr
-rw------- 1 nobody nobody 152 Dec 11 12:11 xtodourl.btr
-rw------- 1 nobody nobody 3314 Dec 11 12:11 SYSMETAINDEX.tbl
-rw------- 1 nobody nobody 3314 Dec 11 12:11 SYSTRIG.tbl
-rw------- 1 nobody nobody 3314 Dec 11 12:11 html.tbl
-rw------- 1 nobody nobody 3314 Dec 11 12:11 refs.tbl
-rw------- 1 nobody nobody 3314 Dec 11 12:11 vortex.tbl
-rw------- 1 nobody nobody 152 Dec 11 12:11 xhtmlhash.btr
-rw------- 1 nobody nobody 152 Dec 11 12:11 xhtmlid.btr
-rw------- 1 nobody nobody 152 Dec 11 12:11 xhtmlurl.btr
-rw------- 1 nobody nobody 152 Dec 11 12:11 xvid.btr
-rw------- 1 nobody nobody 3438 Dec 11 12:11 SYSUSERS.tbl

============ls -lt db2 ===========
total 208
-rw-rw-rw- 1 nobody nobody 125664 Dec 11 12:20 SYSLOCKS
-rw-rw-rw- 1 nobody nobody 8 Dec 11 12:20 SYSLOCKS.SEQ
-rw------- 1 nobody nobody 3336 Dec 11 12:14 counts.tbl
-rw------- 1 nobody nobody 39778 Dec 11 12:14 html.tbl
-rw------- 1 nobody nobody 64910 Dec 11 12:14 refs.tbl
-rw------- 1 nobody nobody 8350 Dec 11 12:14 xhtmlhash.btr
-rw------- 1 nobody nobody 8350 Dec 11 12:14 xhtmlid.btr
-rw------- 1 nobody nobody 8350 Dec 11 12:14 xhtmlurl.btr
-rw------- 1 nobody nobody 12952 Dec 11 12:13 options.tbl
-rw------- 1 nobody nobody 4398 Dec 11 12:13 todo.tbl
-rw------- 1 nobody nobody 8350 Dec 11 12:13 xoptname.btr
-rw------- 1 nobody nobody 8350 Dec 11 12:13 xoptstr.btr
-rw------- 1 nobody nobody 9384 Dec 11 12:13 xtodourl.btr
-rw------- 1 nobody nobody 1328 Dec 11 12:13 SYSSTATS.tbl
-rw------- 1 nobody nobody 27176 Dec 11 12:13 SYSCOLUMNS.tbl
-rw------- 1 nobody nobody 5782 Dec 11 12:13 SYSINDEX.tbl
-rw------- 1 nobody nobody 4714 Dec 11 12:13 SYSPERMS.tbl
-rw------- 1 nobody nobody 10556 Dec 11 12:13 SYSTABLES.tbl
-rw------- 1 nobody nobody 3314 Dec 11 12:13 categories.tbl
-rw------- 1 nobody nobody 3314 Dec 11 12:13 querylog.tbl
-rw------- 1 nobody nobody 152 Dec 11 12:13 xcatno.btr
-rw------- 1 nobody nobody 152 Dec 11 12:13 xqueryid.btr
-rw------- 1 nobody nobody 3314 Dec 11 12:13 error.tbl
-rw------- 1 nobody nobody 3314 Dec 11 12:13 vortex.tbl
-rw------- 1 nobody nobody 152 Dec 11 12:13 xerrorurl.btr
-rw------- 1 nobody nobody 152 Dec 11 12:13 xvid.btr
-rw------- 1 nobody nobody 3314 Dec 11 12:13 SYSMETAINDEX.tbl
-rw------- 1 nobody nobody 3314 Dec 11 12:13 SYSTRIG.tbl
-rw------- 1 nobody nobody 3438 Dec 11 12:13 SYSUSERS.tbl
User avatar
mark
Site Admin
Posts: 5514
Joined: Tue Apr 25, 2000 6:56 pm

No documents match the query ...

Post by mark »

It would appear that the walk is completed, but the dispatcher is not detecting it properly and making the database live for searching. We're not able to replicate the problem here, but we have an idea what may be causing the problem. We'll try to have an update to the dowalk script that you can download later today.
User avatar
mark
Site Admin
Posts: 5514
Joined: Tue Apr 25, 2000 6:56 pm

No documents match the query ...

Post by mark »

Ok, a little later than expected, but there's a new dowalk script you can download from http://www.thunderstone.com/texis/site/ ... ample.html

It does more checking on child processes so it's not likely to be fooled by the OS reusing process id's rapidly. Hopefully this will solve the problem you're encountering. It may also solve the problems some Windows users have reported with walks getting stuck.
ferruccio_manclossi
Posts: 9
Joined: Mon Dec 03, 2001 5:53 am

No documents match the query ...

Post by ferruccio_manclossi »

Thanks guys...
1-downloaded new dowalk script
2-substituted old dowalk with new script to /usr/local/morph3/webinator directory
3-New script owned by my webinator user (nobody)
4-killed all active process (texis, monit, monitor, ...)
5-deleted old profiles and databases
6-Created a new Prova profile with maxpage=10
7-Launched Prova profile
8-Seems hanged after reaching 10 pages
9-Found a <defunct> process child of dispatch action ... ??? Very strange!!!
======= ps -eaf | grep -E "tex|defunct|moni" =====
root 2034 15184 4 11:11:48 pts/0 0:00 grep -E tex|moni|defun
nobody 7532 1 1 10:46:44 - 0:26 ./texis profile=prova /usr/lpp/internet/server_root/pub/webinator/dowalk/dispatch.txt
nobody 12954 7532 120 0:00 <defunct>
nobody 18306 1 0 10:45:13 - 0:00 monit
nobody 19048 1 0 10:46:47 - 0:01 monitor -d /Search/Webinator/prova/db2/ -z
nobody 19782 1 0 10:45:13 - 0:01 monitor -d /usr/local/morph3/texis/testdb/ -z

====== Walk Status =========
Walk Status
Current Profile: prova
Webinator 4.0
Current run: Refresh STOP walk (auto refresh in seconds) Help

Webinator Walk Report for prova

Creating database /Search/Webinator/prova/db2...Done.
Walk started at 2001-12-13 10:46:50 (by user)
Start fetching at http://jplaza.italy.ibm.com/
Ignore urls containing any of the following:
/cgi-bin/
~
?

started 1 (12954) on http://jplaza.italy.ibm.com/
Maxpages of 10 reached.
10 pages fetched (176,891 bytes) from http://jplaza.italy.ibm.com/

10 pages (176,891 bytes) so far. 0 errors so far. 0 duplicate pages so far. 10 http://jplaza.italy.ibm.com/biblio/ (2,330 bytes) 9 http://jplaza.italy.ibm.com/TivCorner.html (9,318 bytes) 8 http://jplaza.italy.ibm.com/Tips/ (11,938 bytes) 7 http://jplaza.italy.ibm.com/Send.html (7,150 bytes) 6 http://jplaza.italy.ibm.com/Resource.html (7,882 bytes) 5 http://jplaza.italy.ibm.com/Presentation.html (10,235 bytes) 4 http://jplaza.italy.ibm.com/News.html (19,773 bytes) 3 http://jplaza.italy.ibm.com/Link.html (93,771 bytes) 2 http://jplaza.italy.ibm.com/Download.html (7,179 bytes) 1 http://jplaza.italy.ibm.com/ (7,315 bytes)

Have I missed something important?
Any suggestions?
Ferruccio
Post Reply