On Jun 12, 2009, at 1:39 PM, Dan Wells wrote:

Hello Victoria,

Though I think it is designed to test a full Evergreen install, running the settings-tester.pl script may help identify problems with just OpenSRF as well. To quote the wiki:

"As the opensrf user, run the settings-tester.pl script to see if it finds any system configuration problems. The script is found at Open- ILS/src/support-scripts/settings-tester.pl in the Evergreen source tree."

Feel free to send the output back to this list if it doesn't mean much to you.

Dan, thanks so much for your reply. I think there might be something more going on than just a simple configuration error, but I defer to the greater wisdom of the list.

I completely wiped the system and started over, just to eliminate anything I might have screwed up. (Ah, the joys of a new machine.) So I'm running Ubuntu 8.04 (with all available updates installed) and I installed OpenSRF 1.0.6. I went meticulously through the installation instructions again. I tried to run the code in "testing connections to OpenSRF on the installation page again. Only this time I could request the math service and get "4" back on the private.localhost connection, but I could *not* get the resource when I would try to do it via the public.localhost connection.

Let me repeat to be clear: testing worked fine on private.localhost, but not on public.localhost.

I did run settings-tester.pl (assumed the trunk version was the latest and greatest, dated 3 months ago). While I got the errors I expected as Evergreen is not yet installed (lots of "please install <library>" statements), it did test the ejabberd stuff:

Checking Jabber connection for user opensrf, domain private.localhost
* Jabber successfully connected

Checking Jabber connection for user opensrf, domain public.localhost
* Jabber successfully connected

Checking Jabber connection for user router, domain public.localhost
* Jabber successfully connected

Checking Jabber connection for user router, domain private.localhost
* Jabber successfully connected




And it looks fine. I see lots of OpenSRF processes:

opensrf   3753     1  2 Jun11 ?        00:37:07 OpenSRF Router
opensrf   3754     1  2 Jun11 ?        00:38:21 OpenSRF Router
opensrf  14159     1  0 14:32 ?        00:00:00 OpenSRF Router
opensrf  14165     1  0 14:32 ?        00:00:00 OpenSRF Router
opensrf 14170 1 0 14:32 ? 00:00:00 OpenSRF controller [opensrf.settings] opensrf 14172 14170 0 14:32 ? 00:00:00 OpenSRF master [opensrf.settings] opensrf 14173 14170 0 14:32 ? 00:00:00 OpenSRF listener [opensrf.settings] opensrf 14174 14172 0 14:32 ? 00:00:00 OpenSRF drone [opensrf.settings] opensrf 14175 14172 0 14:32 ? 00:00:00 OpenSRF drone [opensrf.settings] opensrf 14176 14172 0 14:32 ? 00:00:00 OpenSRF drone [opensrf.settings] opensrf 14177 14172 0 14:32 ? 00:00:00 OpenSRF drone [opensrf.settings] opensrf 14178 14172 0 14:32 ? 00:00:00 OpenSRF drone [opensrf.settings] opensrf 14179 1 0 14:32 ? 00:00:00 OpenSRF controller [opensrf.persist] opensrf 14181 14179 0 14:32 ? 00:00:00 OpenSRF master [opensrf.persist]
opensrf  14183     1  0 14:32 ?        00:00:00 OpenSRF System-C
opensrf 14184 14183 0 14:32 ? 00:00:00 OpenSRF Listener [opensrf.math] opensrf 14185 14184 0 14:32 ? 00:00:00 OpenSRF Drone [opensrf.math] opensrf 14188 14179 0 14:32 ? 00:00:00 OpenSRF listener [opensrf.persist] opensrf 14189 14184 0 14:32 ? 00:00:00 OpenSRF Drone [opensrf.math] opensrf 14190 14183 0 14:32 ? 00:00:00 OpenSRF Listener [opensrf.dbmath] opensrf 14191 14190 0 14:32 ? 00:00:00 OpenSRF Drone [opensrf.dbmath] opensrf 14192 14184 0 14:32 ? 00:00:00 OpenSRF Drone [opensrf.math] opensrf 14194 14190 0 14:32 ? 00:00:00 OpenSRF Drone [opensrf.dbmath] opensrf 14195 14184 0 14:32 ? 00:00:00 OpenSRF Drone [opensrf.math] opensrf 14197 14190 0 14:32 ? 00:00:00 OpenSRF Drone [opensrf.dbmath] opensrf 14198 14184 0 14:32 ? 00:00:00 OpenSRF Drone [opensrf.math] opensrf 14200 14190 0 14:32 ? 00:00:00 OpenSRF Drone [opensrf.dbmath] opensrf 14201 14190 0 14:32 ? 00:00:00 OpenSRF Drone [opensrf.dbmath] opensrf 14204 14181 0 14:32 ? 00:00:00 OpenSRF drone [opensrf.persist] opensrf 14205 14181 0 14:32 ? 00:00:00 OpenSRF drone [opensrf.persist] opensrf 14206 14181 0 14:32 ? 00:00:00 OpenSRF drone [opensrf.persist] opensrf 14207 14181 0 14:32 ? 00:00:00 OpenSRF drone [opensrf.persist] opensrf 14208 14181 0 14:32 ? 00:00:00 OpenSRF drone [opensrf.persist]
vxbush   14399  6589  0 15:27 pts/0    00:00:00 grep OpenSRF




And ejabberd says it's running:

$ sudo ejabberdctl status
Node ejabb...@localhost is started. Status: started
ejabberd is running


Upping the logging to level 4 for the log file /tmp/srfsh.log in my .srfsh.xml file and trying to connect via public.localhost again, I see this:


srfsh 2009-06-12 15:08:52 [INFO:14305:osrf_system.c:415:] Bootstrapping system with domain public.localhost, port 5222, and unixpath (none) srfsh 2009-06-12 15:08:58 [DEBG:14305:osrf_app_session.c:282:] opensrf.math session is stateless srfsh 2009-06-12 15:08:58 [DEBG:14305:osrf_app_session.c:293:] Building a new client session with id [opensrf.math] [1244837338.011557.124483733814305] srfsh 2009-06-12 15:08:58 [DEBG:14305:osrf_app_session.c:500:] AppSession connecting to [email protected]/opensrf.math srfsh 2009-06-12 15:08:58 [DEBG:14305:osrf_app_session.c:456:] App Session [opensrf.math] [1244837338.011557.124483733814305] resetting remote id to [email protected]/opensrf.math srfsh 2009-06-12 15:08:58 [DEBG:14305:osrf_app_session.c:639:] AppSession in queue_wait with timeout 0 srfsh 2009-06-12 15:08:58 [DEBG:14305:osrf_app_session.c:456:] App Session [opensrf.math] [1244837338.011557.124483733814305] resetting remote id to [email protected]/opensrf.math srfsh 2009-06-12 15:08:58 [INFO:14305:osrf_app_session.c:608:] [opensrf.math] sent 83 bytes of data to [email protected]/ opensrf.math srfsh 2009-06-12 15:08:58 [DEBG:14305:osrf_app_session.c:611:] Sent: [{"__c":"osrfMessage","__p":{"threadTrace":"0","locale":"en- US","type":"CONNECT"}}] srfsh 2009-06-12 15:08:58 [DEBG:14305:osrf_app_session.c:639:] AppSession in queue_wait with timeout 5 srfsh 2009-06-12 15:08:58 [INFO:14305:transport_session.c:436:] Received <error> message with type cancel and code 503 srfsh 2009-06-12 15:08:58 [DEBG:14305:osrf_stack.c:24:] Received message from transport code from [email protected]/opensrf.math srfsh 2009-06-12 15:08:58 [DEBG:14305:osrf_stack.c:51:] Transport handler received new message from [email protected]/ opensrf.math to [email protected]/ _evergreen_1244837332.075423_14305 with body

[{"__c":"osrfMessage","__p":{"threadTrace":"0","locale":"en- US","type":"CONNECT"}}]

srfsh 2009-06-12 15:08:58 [DEBG:14305:osrf_stack.c:84:] We received 1 messages from [email protected]/opensrf.math srfsh 2009-06-12 15:08:58 [WARN:14305:osrf_stack.c:95:] !!! Received Jabber layer error message srfsh 2009-06-12 15:08:58 [WARN:14305:osrf_stack.c:105:] * Jabber Error is for top level remote id [[email protected]/ opensrf.math], no one to send my message to! Cutting request short... srfsh 2009-06-12 15:08:58 [INFO:14305:osrf_stack.c:116:] Message processing duration 0.000164 srfsh 2009-06-12 15:08:58 [DEBG:14305:osrf_stack.c:119:] after msg delete srfsh 2009-06-12 15:08:58 [ERR :14305:osrf_app_session.c:516:] cannot communicate with opensrf.math srfsh 2009-06-12 15:08:58 [WARN:14305:srfsh.c:576:] Unable to connect to remote service opensrf.math

srfsh 2009-06-12 15:09:00 [DEBG:14305:socket_bundle.c:394:] removing socket 3



Now why would I get an error about the top level connection, when testing via settings_tester.pl showed that the connections were successfully made?


If that doesn't help, I would say your problem is probably with ejabberd and probably a very small mistake. Try running (as root):

ejabberdctl status

and see what that reports. If it says ejabberd is "not running" try steps 5-7 again from the page you mentioned, including the sub-step in #5, then do the math test again. If that doesn't work, move on to carefully double-check your work in steps 9-10. If you are wondering, I am pretty sure it doesn't hurt anything to run the 'register' commands a second time if you feel you may have missed one.

Good luck,
DW

I did some google spelunking, and discovered that someone else was having problems with code 503 errors in the IRC chat log at
http://www.open-ils.org/irc_logs/openils-evergreen/2009-03/%23openils-evergreen.16-Mon-2009.log

From looking over the responses and configuration files posted, the supposed solution was changing
        {access, max_user_sessions, [{10, all}]}.
to
        {access, max_user_sessions, [{1000, all}]}.
(and which is now shown as an option in the installation instructions). However, I don't have that line, and I do have
        {max_user_sessions, 1000}.
as requested by the installation instructions.

I'm open to any suggestions.


--
Victoria Bush
Opscan Evaluation Manager
Center for Teaching, Learning & Technology
[email protected]



Reply via email to