On Jan 12, 2007, at 8:09 AM, Scott Atchley wrote:
On Jan 11, 2007, at 5:43 PM, Sam Lang wrote:
In any case, Scott it might be worth it to try this patch and see
if we're able to post operations for expected messages before
they're received.
-sam
The server aborts with:
[D 08:08:18.429878] PVFS2 Server version
2.6.1pre1-2007-01-11-200834 starting.
pvfs2-server: src/server/io.sm:357: io_check_write_completion:
Assertion `s_op->u.io.flow_done || s_op->u.io.positive_ack_done'
failed.
Scott
The verbose output of the entire pvfs2-cp up to the abort:
[D 01/12 09:01] PINT_decode
[D 01/12 09:01] lebf_decode_req
[D 01/12 09:01] (0x81934a8) getconfig (prelude sm) state: req_sched
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=0 S=0x81934a8:
getconfig: request
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=0 S=0x81934a8:
getconfig: start
[D 01/12 09:01] (0x81934a8) getconfig (prelude sm) state:
getattr_if_needed
[D 01/12 09:01] PVFS operation "getconfig" got attr mask 127
(attr_uid_valid? yes, attr_owner = 0, credentials_uid = 8175)
(attr_gid_valid? yes, attr_group = 0, credentials.gid = 1001)
[D 01/12 09:01] Final permission check for "getconfig" set error code
to 0
[D 01/12 09:01] (0x81934a8) getconfig (prelude sm) state: perm_check
(status = 0)
[D 01/12 09:01] (0x81934a8) getconfig (FR sm) state: release:
(error_code = 0)
[D 01/12 09:01] (0x81934a8) getconfig (FR sm) state: send_resp
(status = 0)
[D 01/12 09:01] PINT_encode
[D 01/12 09:01] encode_common
[D 01/12 09:01] lebf_encode_resp
[D 01/12 09:01] BMI_post_send_list: addr: 178, count: 1, total_size:
880, tag: 1
[D 01/12 09:01] element 0: offset: 0xb58b4008, size: 880
[D 01/12 09:01] (0x81934a8) getconfig (FR sm) state: cleanup
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=0 S=0x81934a8:
getconfig: finish (Success)
[D 01/12 09:01] PINT_encode_release
[D 01/12 09:01] lebf_encode_rel
[D 01/12 09:01] PINT_decode_release
[D 01/12 09:01] lebf_decode_rel
[D 01/12 09:01] PINT_decode
[D 01/12 09:01] lebf_decode_req
[D 01/12 09:01] (0x81944f0) lookup_path (prelude sm) state: req_sched
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=1048576 S=0x81944f0:
lookup_path: request
[D 01/12 09:01] REQ SCHED POSTING, handle: 1048576, queue_element:
0xb3e02a98
[D 01/12 09:01] REQ SCHED SCHEDULING, handle: 1048576, queue_element:
0xb3e02a98
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=1048576 S=0x81944f0:
lookup_path: start
[D 01/12 09:01] (0x81944f0) lookup_path (prelude sm) state:
getattr_if_needed
[D 01/12 09:01] About to retrieve attributes for handle 1048576
[D 01/12 09:01] dspace_getattr fast path attr cache hit on 1048576
(dfile_count=0 | dist_size=0 | data_size=0)
[D 01/12 09:01] PVFS operation "lookup_path" got attr mask 127
(attr_uid_valid? yes, attr_owner = 8175, credentials_uid =
8175)
(attr_gid_valid? yes, attr_group = 1001, credentials.gid =
1001)
[D 01/12 09:01] Final permission check for "lookup_path" set error
code to 0
[D 01/12 09:01] (0x81944f0) lookup_path (prelude sm) state:
perm_check (status = 0)
[D 01/12 09:01] dspace_getattr fast path attr cache hit on 1048576
(dfile_count=0 | dist_size=0 | data_size=0)
[D 01/12 09:01] attrs = (owner = 8175, group = 1001, perms = 777,
type = 4)
[D 01/12 09:01] - check_mode called ---
(uid=8175,gid=1001,access_type=1)
[D 01/12 09:01] - object attributes --- (uid=8175,gid=1001,mode=511)
[D 01/12 09:01] - checking if uid (8175) is root ...
[D 01/12 09:01] - no
[D 01/12 09:01] - checking if owner (8175) matches uid (8175)...
[D 01/12 09:01] - yes
[D 01/12 09:01] - checking if permissions (511) allows access type
(1) for user...
[D 01/12 09:01] - yes
[D 01/12 09:01] object is a directory; will be looking for handle
for segment "test" in a bit
[D 01/12 09:01] reading dirent handle value from handle 1048576
[D 01/12 09:01] *** Trove KeyVal Read of de
[D 01/12 09:01] dbpf_cache_elem_lookup: cache elem matching 1048576
returned (num_elems=1)
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] [DBPF THREAD]: [KEYVAL -1]: -13
[D 01/12 09:01] [DBPF THREAD]: STARTING TROVE SERVICE ROUTINE
(KEYVAL_READ)
[D 01/12 09:01] dbpf_cache_elem_lookup: cache elem matching 1048576
returned (num_elems=1)
[D 01/12 09:01] ** CANNOT cache data retrieved (key is de)
[D 01/12 09:01] [DBPF THREAD]: FINISHED TROVE SERVICE ROUTINE
(KEYVAL_READ) (ret: 1)
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] reading from dirent handle = 4, segment = test (len=4)
[D 01/12 09:01] *** Trove KeyVal Read of test
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] [DBPF THREAD]: [KEYVAL -1]: -14
[D 01/12 09:01] [DBPF THREAD]: STARTING TROVE SERVICE ROUTINE
(KEYVAL_READ)
[D 01/12 09:01] warning: keyval read error on handle 4 and key= test
(DB_NOTFOUND: No matching key/data pair found)
[D 01/12 09:01] [DBPF THREAD]: FINISHED TROVE SERVICE ROUTINE
(KEYVAL_READ) (ret: -1073742082)
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] sending 'error' response with 0 handle(s) and 0 attr
(s)
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=1048576 S=0x81944f0:
lookup_path: path: test, lookup failed
[D 01/12 09:01] (0x81944f0) lookup_path (FR sm) state: release:
(error_code = -1073742082)
[D 01/12 09:01] REQ SCHED RELEASING, handle: 1048576, queue_element:
0xb3e02a98
[D 01/12 09:01] (0x81944f0) lookup_path (FR sm) state: send_resp
(status = -1073741826)
[D 01/12 09:01] PINT_encode
[D 01/12 09:01] encode_common
[D 01/12 09:01] lebf_encode_resp
[D 01/12 09:01] BMI_post_send_list: addr: 178, count: 1, total_size:
32, tag: 2
[D 01/12 09:01] element 0: offset: 0xb58b4008, size: 32
[D 01/12 09:01] (0x81944f0) lookup_path (FR sm) state: cleanup
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=1048576 S=0x81944f0:
lookup_path: finish (No such file or directory)
[D 01/12 09:01] PINT_encode_release
[D 01/12 09:01] lebf_encode_rel
[D 01/12 09:01] PINT_decode_release
[D 01/12 09:01] lebf_decode_rel
[D 01/12 09:01] PINT_decode
[D 01/12 09:01] lebf_decode_req
[D 01/12 09:01] (0x8195538) getattr (prelude sm) state: req_sched
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=1048576 S=0x8195538:
getattr: request
[D 01/12 09:01] REQ SCHED POSTING, handle: 1048576, queue_element:
0xb3e02ba0
[D 01/12 09:01] REQ SCHED SCHEDULING, handle: 1048576, queue_element:
0xb3e02ba0
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=1048576 S=0x8195538:
getattr: start
[D 01/12 09:01] (0x8195538) getattr (prelude sm) state:
getattr_if_needed
[D 01/12 09:01] About to retrieve attributes for handle 1048576
[D 01/12 09:01] dspace_getattr fast path attr cache hit on 1048576
(dfile_count=0 | dist_size=0 | data_size=0)
[D 01/12 09:01] PVFS operation "getattr" got attr mask 127
(attr_uid_valid? yes, attr_owner = 8175, credentials_uid =
8175)
(attr_gid_valid? yes, attr_group = 1001, credentials.gid =
1001)
[D 01/12 09:01] Final permission check for "getattr" set error code to 0
[D 01/12 09:01] (0x8195538) getattr (prelude sm) state: perm_check
(status = 0)
[D 01/12 09:01] No version found! Using mtime 1168610456
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=1048576 S=0x8195538:
getattr: type: directory
[D 01/12 09:01] getattr: dirent_count not needed.
[D 01/12 09:01] getattr: dfile_count needed.
[D 01/12 09:01] *** Trove KeyVal Read of de
[D 01/12 09:01] dbpf_cache_elem_lookup: cache elem matching 1048576
returned (num_elems=1)
[D 01/12 09:01] op_queue add: 0xb3e09d70
[D 01/12 09:01] [DBPF THREAD]: [KEYVAL -1]: -15
[D 01/12 09:01] [DBPF THREAD]: STARTING TROVE SERVICE ROUTINE
(KEYVAL_READ)
[D 01/12 09:01] dbpf_cache_elem_lookup: cache elem matching 1048576
returned (num_elems=1)
[D 01/12 09:01] ** CANNOT cache data retrieved (key is de)
[D 01/12 09:01] [DBPF THREAD]: FINISHED TROVE SERVICE ROUTINE
(KEYVAL_READ) (ret: 1)
[D 01/12 09:01] op_queue add: 0xb3e09d70
[D 01/12 09:01] trying to getxattr of
user.pvfs2.dist_name,user.pvfs2.dist_params,user.pvfs2.num_dfiles of
dir handle (coll_id = 1318064247, handle = 1048576
[D 01/12 09:01] op_queue add: 0xb3e0c2d0
[D 01/12 09:01] [DBPF THREAD]: [KEYVAL -1]: -16
[D 01/12 09:01] [DBPF THREAD]: STARTING TROVE SERVICE ROUTINE
(KEYVAL_READ_LIST)
[D 01/12 09:01] keyval get user.pvfs2.dist_name failed with error
DB_NOTFOUND: No matching key/data pair found
[D 01/12 09:01] keyval get user.pvfs2.dist_params failed with error
DB_NOTFOUND: No matching key/data pair found
[D 01/12 09:01] keyval get user.pvfs2.num_dfiles failed with error
DB_NOTFOUND: No matching key/data pair found
[D 01/12 09:01] [DBPF THREAD]: FINISHED TROVE SERVICE ROUTINE
(KEYVAL_READ_LIST) (ret: -1073742082)
[D 01/12 09:01] op_queue add: 0xb3e0c2d0
[D 01/12 09:01] getattr: job status code = -1073742082
[D 01/12 09:01] getattr: dir hint dfile_count: 0
[D 01/12 09:01] - RETURNING retrieved attrs: [owner = 8175, group = 1001
perms = 777, type = 4, atime = 1168610456, mtime = 1168610456
ctime = 1168610456, dist_size = 0]
[D 01/12 09:01] server returning dirent_count = 0 dfile_count = 0
dist_name_len = 0 dist_params_len = 0
[D 01/12 09:01] @ End DIRECTORY attributes: sending status 0 (error = 0)
[D 01/12 09:01] (0x8195538) getattr (FR sm) state: release:
(error_code = 0)
[D 01/12 09:01] REQ SCHED RELEASING, handle: 1048576, queue_element:
0xb3e02ba0
[D 01/12 09:01] (0x8195538) getattr (FR sm) state: send_resp (status
= 0)
[D 01/12 09:01] PINT_encode
[D 01/12 09:01] encode_common
[D 01/12 09:01] lebf_encode_resp
[D 01/12 09:01] BMI_post_send_list: addr: 178, count: 1, total_size:
108, tag: 3
[D 01/12 09:01] element 0: offset: 0xb58b4008, size: 108
[D 01/12 09:01] (0x8195538) getattr (FR sm) state: cleanup
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=1048576 S=0x8195538:
getattr: finish (Success)
[D 01/12 09:01] PINT_encode_release
[D 01/12 09:01] lebf_encode_rel
[D 01/12 09:01] PINT_decode_release
[D 01/12 09:01] lebf_decode_rel
[D 01/12 09:01] PINT_decode
[D 01/12 09:01] lebf_decode_req
[D 01/12 09:01] (0x8196580) create (prelude sm) state: req_sched
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=0 S=0x8196580:
create: request
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=0 S=0x8196580:
create: start
[D 01/12 09:01] (0x8196580) create (prelude sm) state: getattr_if_needed
[D 01/12 09:01] PVFS operation "create" got attr mask 127
(attr_uid_valid? yes, attr_owner = 0, credentials_uid = 8175)
(attr_gid_valid? yes, attr_group = 0, credentials.gid = 1001)
[D 01/12 09:01] Final permission check for "create" set error code to 0
[D 01/12 09:01] (0x8196580) create (prelude sm) state: perm_check
(status = 0)
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] [SYNC_COALESCE]: enqueue called
[D 01/12 09:01] [SYNC_COALESCE]: enqueue 1 counter sync:1 non_sync:0
[D 01/12 09:01] [SYNC_COALESCE]: dequeue called
[D 01/12 09:01] [SYNC_COALESCE]: dequeue 1 counter sync:0 non_sync:0
[D 01/12 09:01] [DBPF THREAD]: STARTING TROVE SERVICE ROUTINE
(DSPACE_CREATE)
[D 01/12 09:01] [1 extents] -- new_handle is 1048578 (cur_extent is 4
- 2147483650)
[D 01/12 09:01] dbpf_attr_cache_insert: inserting 1048578 (b_size is 0)
[D 01/12 09:01] [DBPF THREAD]: FINISHED TROVE SERVICE ROUTINE
(DSPACE_CREATE) (ret: 1)
[D 01/12 09:01] [SYNC_COALESCE]: sync_coalesce called, handle: 0, cid: 0
[D 01/12 09:01] [SYNC_COALESCE]: sync_context: 0x80ef060
[D 01/12 09:01] [SYNC_COALESCE]: sync context type is DSPACE
[D 01/12 09:01] [SYNC_COALESCE]: high or low watermark reached:
coalesced: 0
queued: 0
[D 01/12 09:01] [SYNC_COALESCE]: syncing now!
[D 01/12 09:01] [SYNC_COALESCE]: coalesce 1 sync start in
coalesce_queue:0 pending:0
[D 01/12 09:01] [SYNC_COALESCE]: coalesce 1 sync stop
[D 01/12 09:01] [SYNC_COALESCE]: moving op 0xb3e09e30 with handle: 0
to completion queue
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] Handle created: 1048578
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=0 S=0x8196580:
create: new handle: 1048578, type metafile.
[D 01/12 09:01] (0x8196580) create (FR sm) state: release:
(error_code = 0)
[D 01/12 09:01] (0x8196580) create (FR sm) state: send_resp (status = 0)
[D 01/12 09:01] PINT_encode
[D 01/12 09:01] encode_common
[D 01/12 09:01] lebf_encode_resp
[D 01/12 09:01] BMI_post_send_list: addr: 178, count: 1, total_size:
24, tag: 4
[D 01/12 09:01] element 0: offset: 0xb58b4008, size: 24
[D 01/12 09:01] (0x8196580) create (FR sm) state: cleanup
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=0 S=0x8196580:
create: finish (Success)
[D 01/12 09:01] PINT_encode_release
[D 01/12 09:01] lebf_encode_rel
[D 01/12 09:01] PINT_decode_release
[D 01/12 09:01] lebf_decode_rel
[D 01/12 09:01] PINT_decode
[D 01/12 09:01] lebf_decode_req
[D 01/12 09:01] (0x81975c8) create (prelude sm) state: req_sched
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=0 S=0x81975c8:
create: request
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=0 S=0x81975c8:
create: start
[D 01/12 09:01] (0x81975c8) create (prelude sm) state: getattr_if_needed
[D 01/12 09:01] PVFS operation "create" got attr mask 127
(attr_uid_valid? yes, attr_owner = 0, credentials_uid = 8175)
(attr_gid_valid? yes, attr_group = 0, credentials.gid = 1001)
[D 01/12 09:01] Final permission check for "create" set error code to 0
[D 01/12 09:01] (0x81975c8) create (prelude sm) state: perm_check
(status = 0)
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] [SYNC_COALESCE]: enqueue called
[D 01/12 09:01] [SYNC_COALESCE]: enqueue 1 counter sync:1 non_sync:0
[D 01/12 09:01] [SYNC_COALESCE]: dequeue called
[D 01/12 09:01] [SYNC_COALESCE]: dequeue 1 counter sync:0 non_sync:0
[D 01/12 09:01] [DBPF THREAD]: STARTING TROVE SERVICE ROUTINE
(DSPACE_CREATE)
[D 01/12 09:01] [1 extents] -- new_handle is 4294967291 (cur_extent
is 2147483651 - 4294967297)
[D 01/12 09:01] dbpf_attr_cache_insert: inserting 4294967291 (b_size
is 0)
[D 01/12 09:01] [DBPF THREAD]: FINISHED TROVE SERVICE ROUTINE
(DSPACE_CREATE) (ret: 1)
[D 01/12 09:01] [SYNC_COALESCE]: sync_coalesce called, handle: 0, cid: 0
[D 01/12 09:01] [SYNC_COALESCE]: sync_context: 0x80ef060
[D 01/12 09:01] [SYNC_COALESCE]: sync context type is DSPACE
[D 01/12 09:01] [SYNC_COALESCE]: high or low watermark reached:
coalesced: 0
queued: 0
[D 01/12 09:01] [SYNC_COALESCE]: syncing now!
[D 01/12 09:01] [SYNC_COALESCE]: coalesce 1 sync start in
coalesce_queue:0 pending:0
[D 01/12 09:01] [SYNC_COALESCE]: coalesce 1 sync stop
[D 01/12 09:01] [SYNC_COALESCE]: moving op 0xb3e09e30 with handle: 0
to completion queue
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] Handle created: 4294967291
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=0 S=0x81975c8:
create: new handle: 4294967291, type datafile.
[D 01/12 09:01] (0x81975c8) create (FR sm) state: release:
(error_code = 0)
[D 01/12 09:01] (0x81975c8) create (FR sm) state: send_resp (status = 0)
[D 01/12 09:01] PINT_encode
[D 01/12 09:01] encode_common
[D 01/12 09:01] lebf_encode_resp
[D 01/12 09:01] BMI_post_send_list: addr: 178, count: 1, total_size:
24, tag: 5
[D 01/12 09:01] element 0: offset: 0xb58b4008, size: 24
[D 01/12 09:01] (0x81975c8) create (FR sm) state: cleanup
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=0 S=0x81975c8:
create: finish (Success)
[D 01/12 09:01] PINT_encode_release
[D 01/12 09:01] lebf_encode_rel
[D 01/12 09:01] PINT_decode_release
[D 01/12 09:01] lebf_decode_rel
[D 01/12 09:01] PINT_decode
[D 01/12 09:01] lebf_decode_req
[D 01/12 09:01] (0x8198610) setattr (prelude sm) state: req_sched
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=1048578 S=0x8198610:
setattr: request
[D 01/12 09:01] REQ SCHED POSTING, handle: 1048578, queue_element:
0xb3e02be8
[D 01/12 09:01] REQ SCHED SCHEDULING, handle: 1048578, queue_element:
0xb3e02be8
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=1048578 S=0x8198610:
setattr: start
[D 01/12 09:01] (0x8198610) setattr (prelude sm) state:
getattr_if_needed
[D 01/12 09:01] About to retrieve attributes for handle 1048578
[D 01/12 09:01] dspace_getattr fast path attr cache hit on 1048578
(dfile_count=0 | dist_size=0 | data_size=0)
[D 01/12 09:01] PVFS operation "setattr" got attr mask 127
(attr_uid_valid? yes, attr_owner = 0, credentials_uid = 8175)
(attr_gid_valid? yes, attr_group = 0, credentials.gid = 1001)
[D 01/12 09:01] Final permission check for "setattr" set error code to 0
[D 01/12 09:01] (0x8198610) setattr (prelude sm) state: perm_check
(status = 0)
[D 01/12 09:01] attrs read from dspace:
[owner = 0, group = 0, perms = 0, type = 1]
[D 01/12 09:01] attrs read from request:
[owner = 8175, group = 1001, perms = 644, type = 1]
[D 01/12 09:01] handle 1048578 refers to a metafile
[D 01/12 09:01] *** dspace has dfile count 0 and req has dfile count 1
[D 01/12 09:01] *** dspace has dist size 0 and req has dist size 48
[D 01/12 09:01] *** using dfile_count of 1
[D 01/12 09:01] *** using dist_size of 48
[D 01/12 09:01] request has dfile_count of 1 | dspace has 1
[D 01/12 09:01] metafile has 1 datafiles associated with it
[D 01/12 09:01] writing dh [1048578,1318064247,len 8]
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] [SYNC_COALESCE]: enqueue called
[D 01/12 09:01] [SYNC_COALESCE]: enqueue 0 counter sync:0 non_sync:1
[D 01/12 09:01] [SYNC_COALESCE]: dequeue called
[D 01/12 09:01] [SYNC_COALESCE]: dequeue 0 counter sync:0 non_sync:0
[D 01/12 09:01] [DBPF THREAD]: [KEYVAL -1]: -17
[D 01/12 09:01] [DBPF THREAD]: STARTING TROVE SERVICE ROUTINE
(KEYVAL_WRITE)
[D 01/12 09:01] dbpf_keyval_write_op_svc: handle: 1048578, key: dh
[D 01/12 09:01] keyval_db->put(handle= 1048578, key= dh (3)) size=11
[D 01/12 09:01] *** Trove KeyVal Write of dh
[D 01/12 09:01] dbpf_cache_elem_lookup: cache elem matching 1048578
returned (num_elems=3)
[D 01/12 09:01] dbpf_cache_elem_lookup: cache elem matching 1048578
returned (num_elems=3)
[D 01/12 09:01] Setting data 0xb3e02bd8 based on key 1048578 and
key_str dh (data_sz=8)
[D 01/12 09:01] *** cached keyval data written (key is dh)
[D 01/12 09:01] [DBPF THREAD]: FINISHED TROVE SERVICE ROUTINE
(KEYVAL_WRITE) (ret: 1)
[D 01/12 09:01] [SYNC_COALESCE]: sync_coalesce called, handle:
1048578, cid: 0
[D 01/12 09:01] [SYNC_COALESCE]: sync not needed, moving to
completion queue: 1048578
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] metafile distribution size = 48
[D 01/12 09:01] writing md [1048578,1318064247,len 48]
[D 01/12 09:01] op_queue add: 0xb3e0ef58
[D 01/12 09:01] [SYNC_COALESCE]: enqueue called
[D 01/12 09:01] [SYNC_COALESCE]: enqueue 0 counter sync:1 non_sync:0
[D 01/12 09:01] [SYNC_COALESCE]: dequeue called
[D 01/12 09:01] [SYNC_COALESCE]: dequeue 0 counter sync:0 non_sync:0
[D 01/12 09:01] [DBPF THREAD]: [KEYVAL -1]: -18
[D 01/12 09:01] [DBPF THREAD]: STARTING TROVE SERVICE ROUTINE
(KEYVAL_WRITE)
[D 01/12 09:01] dbpf_keyval_write_op_svc: handle: 1048578, key: md
[D 01/12 09:01] keyval_db->put(handle= 1048578, key= md (3)) size=11
[D 01/12 09:01] *** Trove KeyVal Write of md
[D 01/12 09:01] dbpf_cache_elem_lookup: cache elem matching 1048578
returned (num_elems=3)
[D 01/12 09:01] dbpf_cache_elem_lookup: cache elem matching 1048578
returned (num_elems=3)
[D 01/12 09:01] ** CANNOT cache data written (key is md)
[D 01/12 09:01] [DBPF THREAD]: FINISHED TROVE SERVICE ROUTINE
(KEYVAL_WRITE) (ret: 1)
[D 01/12 09:01] [SYNC_COALESCE]: sync_coalesce called, handle:
1048578, cid: 0
[D 01/12 09:01] [SYNC_COALESCE]: sync_context: 0x80eef20
[D 01/12 09:01] [SYNC_COALESCE]: sync context type is KEYVAL
[D 01/12 09:01] [SYNC_COALESCE]: high or low watermark reached:
coalesced: 0
queued: 0
[D 01/12 09:01] [SYNC_COALESCE]: syncing now!
[D 01/12 09:01] [SYNC_COALESCE]: coalesce 0 sync start in
coalesce_queue:0 pending:0
[D 01/12 09:01] [SYNC_COALESCE]: coalesce 0 sync stop
[D 01/12 09:01] [SYNC_COALESCE]: moving op 0xb3e0ef58 with handle:
1048578 to completion queue
[D 01/12 09:01] op_queue add: 0xb3e0ef58
[D 01/12 09:01] request has dfile_count of 1 | dspace has 1
[D 01/12 09:01] writing count of 1 to disk
[D 01/12 09:01] WRITING attrs: [owner = 8175, group = 1001
perms = 644, type = 1, atime = 1168610468, mtime = 0
ctime = 1168610468 | dfile_count = 1 | dist_size = 48]
[D 01/12 09:01] op_queue add: 0xb3e0ef58
[D 01/12 09:01] [SYNC_COALESCE]: enqueue called
[D 01/12 09:01] [SYNC_COALESCE]: enqueue 1 counter sync:1 non_sync:0
[D 01/12 09:01] [SYNC_COALESCE]: dequeue called
[D 01/12 09:01] [SYNC_COALESCE]: dequeue 1 counter sync:0 non_sync:0
[D 01/12 09:01] [DBPF THREAD]: STARTING TROVE SERVICE ROUTINE
(DSPACE_SETATTR)
[D 01/12 09:01] dbpf_cache_elem_lookup: cache elem matching 1048578
returned (num_elems=3)
[D 01/12 09:01] Updating cached attributes for key 1048578
[D 01/12 09:01] [DBPF THREAD]: FINISHED TROVE SERVICE ROUTINE
(DSPACE_SETATTR) (ret: 1)
[D 01/12 09:01] [SYNC_COALESCE]: sync_coalesce called, handle:
1048578, cid: 0
[D 01/12 09:01] [SYNC_COALESCE]: sync_context: 0x80ef060
[D 01/12 09:01] [SYNC_COALESCE]: sync context type is DSPACE
[D 01/12 09:01] [SYNC_COALESCE]: high or low watermark reached:
coalesced: 0
queued: 0
[D 01/12 09:01] [SYNC_COALESCE]: syncing now!
[D 01/12 09:01] [SYNC_COALESCE]: coalesce 1 sync start in
coalesce_queue:0 pending:0
[D 01/12 09:01] [SYNC_COALESCE]: coalesce 1 sync stop
[D 01/12 09:01] [SYNC_COALESCE]: moving op 0xb3e0ef58 with handle:
1048578 to completion queue
[D 01/12 09:01] op_queue add: 0xb3e0ef58
[D 01/12 09:01] (0x8198610) setattr (FR sm) state: release:
(error_code = 0)
[D 01/12 09:01] REQ SCHED RELEASING, handle: 1048578, queue_element:
0xb3e02be8
[D 01/12 09:01] (0x8198610) setattr (FR sm) state: send_resp (status
= 0)
[D 01/12 09:01] PINT_encode
[D 01/12 09:01] encode_common
[D 01/12 09:01] lebf_encode_resp
[D 01/12 09:01] BMI_post_send_list: addr: 178, count: 1, total_size:
16, tag: 6
[D 01/12 09:01] element 0: offset: 0xb58b4008, size: 16
[D 01/12 09:01] (0x8198610) setattr (FR sm) state: cleanup
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=1048578 S=0x8198610:
setattr: finish (Success)
[D 01/12 09:01] PINT_encode_release
[D 01/12 09:01] lebf_encode_rel
[D 01/12 09:01] PINT_decode_release
[D 01/12 09:01] lebf_decode_rel
[D 01/12 09:01] PINT_decode
[D 01/12 09:01] lebf_decode_req
[D 01/12 09:01] (0x8199658) crdirent (prelude sm) state: req_sched
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=1048576 S=0x8199658:
crdirent: request
[D 01/12 09:01] REQ SCHED POSTING, handle: 1048576, queue_element:
0xb3e02ba0
[D 01/12 09:01] REQ SCHED SCHEDULING, handle: 1048576, queue_element:
0xb3e02ba0
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=1048576 S=0x8199658:
crdirent: start
[D 01/12 09:01] (0x8199658) crdirent (prelude sm) state:
getattr_if_needed
[D 01/12 09:01] About to retrieve attributes for handle 1048576
[D 01/12 09:01] dspace_getattr fast path attr cache hit on 1048576
(dfile_count=0 | dist_size=0 | data_size=0)
[D 01/12 09:01] PVFS operation "crdirent" got attr mask 127
(attr_uid_valid? yes, attr_owner = 8175, credentials_uid =
8175)
(attr_gid_valid? yes, attr_group = 1001, credentials.gid =
1001)
[D 01/12 09:01] - check_mode called ---
(uid=8175,gid=1001,access_type=2)
[D 01/12 09:01] - object attributes --- (uid=8175,gid=1001,mode=511)
[D 01/12 09:01] - checking if uid (8175) is root ...
[D 01/12 09:01] - no
[D 01/12 09:01] - checking if owner (8175) matches uid (8175)...
[D 01/12 09:01] - yes
[D 01/12 09:01] - checking if permissions (511) allows access type
(2) for user...
[D 01/12 09:01] - yes
[D 01/12 09:01] - check_mode called ---
(uid=8175,gid=1001,access_type=1)
[D 01/12 09:01] - object attributes --- (uid=8175,gid=1001,mode=511)
[D 01/12 09:01] - checking if uid (8175) is root ...
[D 01/12 09:01] - no
[D 01/12 09:01] - checking if owner (8175) matches uid (8175)...
[D 01/12 09:01] - yes
[D 01/12 09:01] - checking if permissions (511) allows access type
(1) for user...
[D 01/12 09:01] - yes
[D 01/12 09:01] Final permission check for "crdirent" set error code
to 0
[D 01/12 09:01] (0x8199658) crdirent (prelude sm) state: perm_check
(status = 0)
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=1048576 S=0x8199658:
crdirent: crdirent entry: test points to 1048578
[D 01/12 09:01] got crdirent for test (with handle 1048578) in 1048576
[D 01/12 09:01] *** Trove KeyVal Read of de
[D 01/12 09:01] dbpf_cache_elem_lookup: cache elem matching 1048576
returned (num_elems=3)
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] [DBPF THREAD]: [KEYVAL -1]: -19
[D 01/12 09:01] [DBPF THREAD]: STARTING TROVE SERVICE ROUTINE
(KEYVAL_READ)
[D 01/12 09:01] dbpf_cache_elem_lookup: cache elem matching 1048576
returned (num_elems=3)
[D 01/12 09:01] ** CANNOT cache data retrieved (key is de)
[D 01/12 09:01] [DBPF THREAD]: FINISHED TROVE SERVICE ROUTINE
(KEYVAL_READ) (ret: 1)
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] writing new directory entry for test (handle =
1048578) to dirdata dspace 4
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] [SYNC_COALESCE]: enqueue called
[D 01/12 09:01] [SYNC_COALESCE]: enqueue 0 counter sync:1 non_sync:0
[D 01/12 09:01] [SYNC_COALESCE]: dequeue called
[D 01/12 09:01] [SYNC_COALESCE]: dequeue 0 counter sync:0 non_sync:0
[D 01/12 09:01] [DBPF THREAD]: [KEYVAL -1]: -20
[D 01/12 09:01] [DBPF THREAD]: STARTING TROVE SERVICE ROUTINE
(KEYVAL_WRITE)
[D 01/12 09:01] dbpf_keyval_write_op_svc: handle: 4, key: test
[D 01/12 09:01] keyval_db->put(handle= 4, key= test (5)) size=13
[D 01/12 09:01] *** Trove KeyVal Write of test
[D 01/12 09:01] [DBPF KEYVAL]: handle_info count increment: handle:
4, value: 1
[D 01/12 09:01] [DBPF THREAD]: FINISHED TROVE SERVICE ROUTINE
(KEYVAL_WRITE) (ret: 1)
[D 01/12 09:01] [SYNC_COALESCE]: sync_coalesce called, handle: 4, cid: 0
[D 01/12 09:01] [SYNC_COALESCE]: sync_context: 0x80eef20
[D 01/12 09:01] [SYNC_COALESCE]: sync context type is KEYVAL
[D 01/12 09:01] [SYNC_COALESCE]: high or low watermark reached:
coalesced: 0
queued: 0
[D 01/12 09:01] [SYNC_COALESCE]: syncing now!
[D 01/12 09:01] [SYNC_COALESCE]: coalesce 0 sync start in
coalesce_queue:0 pending:0
[D 01/12 09:01] [SYNC_COALESCE]: coalesce 0 sync stop
[D 01/12 09:01] [SYNC_COALESCE]: moving op 0xb3e09e30 with handle: 4
to completion queue
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] [SYNC_COALESCE]: enqueue called
[D 01/12 09:01] [SYNC_COALESCE]: enqueue 1 counter sync:1 non_sync:0
[D 01/12 09:01] [SYNC_COALESCE]: dequeue called
[D 01/12 09:01] [SYNC_COALESCE]: dequeue 1 counter sync:0 non_sync:0
[D 01/12 09:01] [DBPF THREAD]: STARTING TROVE SERVICE ROUTINE
(DSPACE_SETATTR)
[D 01/12 09:01] dbpf_cache_elem_lookup: cache elem matching 1048576
returned (num_elems=3)
[D 01/12 09:01] Updating cached attributes for key 1048576
[D 01/12 09:01] [DBPF THREAD]: FINISHED TROVE SERVICE ROUTINE
(DSPACE_SETATTR) (ret: 1)
[D 01/12 09:01] [SYNC_COALESCE]: sync_coalesce called, handle:
1048576, cid: 0
[D 01/12 09:01] [SYNC_COALESCE]: sync_context: 0x80ef060
[D 01/12 09:01] [SYNC_COALESCE]: sync context type is DSPACE
[D 01/12 09:01] [SYNC_COALESCE]: high or low watermark reached:
coalesced: 0
queued: 0
[D 01/12 09:01] [SYNC_COALESCE]: syncing now!
[D 01/12 09:01] [SYNC_COALESCE]: coalesce 1 sync start in
coalesce_queue:0 pending:0
[D 01/12 09:01] [SYNC_COALESCE]: coalesce 1 sync stop
[D 01/12 09:01] [SYNC_COALESCE]: moving op 0xb3e09e30 with handle:
1048576 to completion queue
[D 01/12 09:01] op_queue add: 0xb3e09e30
[D 01/12 09:01] (0x8199658) crdirent (FR sm) state: release:
(error_code = 0)
[D 01/12 09:01] REQ SCHED RELEASING, handle: 1048576, queue_element:
0xb3e02ba0
[D 01/12 09:01] (0x8199658) crdirent (FR sm) state: send_resp (status
= 0)
[D 01/12 09:01] PINT_encode
[D 01/12 09:01] encode_common
[D 01/12 09:01] lebf_encode_resp
[D 01/12 09:01] BMI_post_send_list: addr: 178, count: 1, total_size:
16, tag: 7
[D 01/12 09:01] element 0: offset: 0xb58b4008, size: 16
[D 01/12 09:01] (0x8199658) crdirent (FR sm) state: cleanup
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=1048576 S=0x8199658:
crdirent: finish (Success)
[D 01/12 09:01] PINT_encode_release
[D 01/12 09:01] lebf_encode_rel
[D 01/12 09:01] PINT_decode_release
[D 01/12 09:01] lebf_decode_rel
[D 01/12 09:01] PINT_decode
[D 01/12 09:01] lebf_decode_req
[D 01/12 09:01] (0x819a6a0) io (prelude sm) state: req_sched
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=4294967291
S=0x819a6a0: io: request
[D 01/12 09:01] REQ SCHED POSTING, handle: 4294967291, queue_element:
0xb3e02930
[D 01/12 09:01] REQ SCHED SCHEDULING, handle: 4294967291,
queue_element: 0xb3e02930
[A 01/12 09:01] [EMAIL PROTECTED]://fog34:0:0 H=4294967291
S=0x819a6a0: io: start
[D 01/12 09:01] (0x819a6a0) io (prelude sm) state: getattr_if_needed
[D 01/12 09:01] About to retrieve attributes for handle 4294967291
[D 01/12 09:01] dspace_getattr fast path attr cache hit on 4294967291
(dfile_count=0 | dist_size=0 | data_size=0)
[D 01/12 09:01] PVFS operation "io" got attr mask 127
(attr_uid_valid? yes, attr_owner = 0, credentials_uid = 8175)
(attr_gid_valid? yes, attr_group = 0, credentials.gid = 1001)
[D 01/12 09:01] Final permission check for "io" set error code to 0
[D 01/12 09:01] (0x819a6a0) io (prelude sm) state: perm_check (status
= 0)
[D 01/12 09:01] PINT_encode
[D 01/12 09:01] encode_common
[D 01/12 09:01] lebf_encode_resp
[D 01/12 09:01] BMI_post_send_list: addr: 178, count: 1, total_size:
24, tag: 8
[D 01/12 09:01] element 0: offset: 0xb58b4008, size: 24
[D 01/12 09:01] flow: fsize: 0, server_nr: 0, server_ct: 1
[D 01/12 09:01] file_req_offset: 0, aggregate_size: 52428800,
handle: 4294967291
[D 01/12 09:01] io_start_flow() issuing flow to write data.
[D 01/12 09:01] PINT_New_request_state
[D 01/12 09:01] flowproto posting 0xb3e0ff88
[D 01/12 09:01] flowproto-multiqueue forcing trove_write_callback_fn.
[D 01/12 09:01] flowproto-multiqueue trove_write_callback_fn,
error_code: 0, flow: 0xb3e0ff88.
[D 01/12 09:01]
=========================================================
[D 01/12 09:01] PINT_process_request
[D 01/12 09:01] tiling 52428801 copies
[D 01/12 09:01] Do seq of 0 ne 1 st 1 nb 0 ub 1 lb 0 as 1 co 0
[D 01/12 09:01] lvl 0 el 0 blk 0 by 0
Early expected message length 1048576 tag 8 match 0xe000000100000008
[D 01/12 09:01] to 0 ta 0 fi 52428800
[D 01/12 09:01] basic type or contiguous data
[D 01/12 09:01] Request disp
[D 01/12 09:01] contig_offset = 0 contig_size = 52428801
lvl_flag = 1
[D 01/12 09:01] calling distribute
[D 01/12 09:01] PINT_distribute
[D 01/12 09:01] of 0 sz 52428800 ix 0 sm 64 by 0 bm
1048576 fsz 0 exfl 1
[D 01/12 09:01] begin iteration loff: 0
[D 01/12 09:01] segment exceeds byte limit
[D 01/12 09:01] file being extended
[D 01/12 09:01] process a segment
[D 01/12 09:01] of 0 sz 1048576
[D 01/12 09:01] add a segment 0
[D 01/12 09:01] end iteration
[D 01/12 09:01] done with segments or bytes
[D 01/12 09:01] of 1048576 sz 51380224 sg 1
sm 64 by 1048576 bm 1048576
[D 01/12 09:01] next loff: 1048576 next poff:
1048576
[D 01/12 09:01] return value 1048576
[D 01/12 09:01] finished
[D 01/12 09:01] retval = 1048576
[D 01/12 09:01] exiting distribute returned less than expected
[D 01/12 09:01] done sg 1 sm 64 by 1048576 bm 1048576 ta 0 to
1048576 fo 52428800 eof 0
[D 01/12 09:01]
=========================================================
[D 01/12 09:01] BMI_post_recv: addr: 178, offset: 0xb59b5008, size:
1048576, tag: 8
[D 01/12 09:01] flowproto posted 0xb3e0ff88
[D 01/12 09:01] Job flows in progress (post time): 1
pvfs2-server: src/server/io.sm:357: io_check_write_completion:
Assertion `s_op->u.io.flow_done || s_op->u.io.positive_ack_done' failed.
_______________________________________________
Pvfs2-developers mailing list
Pvfs2-developers@beowulf-underground.org
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers