Re: [Users] HELP, ISCSI Freakin!
- Original Message - Forgot one thing, the paste bin link! http://pastebin.com/3t7E41JK Date: Wednesday, July 17, 2013 7:08 PM To: users users@ovirt.org Subject: HELP, ISCSI Freakin! Hey Guys, I am running oVirt 3.1; and I am getting the following in dmesg, and I have pasted my VG info below. We are running iSCSI luns and this is a node complaining in the log. Our equalogic storage was close to full; and this started happening. I have extended it, and it is thin provisioned; but I am not sure if a pvresize is needed. If so, can it be done while online? Trying Looking at your VG info you have at least 450GB free on every storage domain so free space doesn't seem like the issue. You have real I/O errors there. The problematic device is: /dev/mapper/36090a028108c2e4a2214151d00c9 And it probably belongs to vg: 2b93dc46-0db7-4a2a-a494-a294452c9d2f If this device/vg is no longer relevant then run dmsetup remove /dev/mapper/36090a028108c2e4a2214151d00c9 If that doesn't work then you can try running it with '-f' (force). This will remove the device from the system. If it is relevant, then you need to fix the problem and make sure the system has access to it. not to take down this cluster; several heavily used VMs on it. The VMs seem to be running ok for now; but this is filling up the logs pretty quickly; as it appears to be retrying to connections and to release the resources. All help is appreciated . Thanks for all your help guys; and feel free to email me directly if I can provide any more info. Log Snippet: Thread-550892::DEBUG::2013-07-17 19:06:14,335::resourceManager::565::ResourceManager::(releaseResource) No one is waiting for resource 'Storage.69d66cf5-9426-4385-be6c-c55e2bd40617', Clearing records. Thread-550892::DEBUG::2013-07-17 19:06:14,335::task::978::TaskManager.Task::(_decref) Task=`03810e67-5dab-4193-8cbd-c633c8c25fa4`::ref 0 aborting False Thread-550892::DEBUG::2013-07-17 19:06:14,336::task::588::TaskManager.Task::(_updateState) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::moving from state init - state preparing Thread-550892::INFO::2013-07-17 19:06:14,336::logUtils::37::dispatcher::(wrapper) Run and protect: getVolumeSize(sdUUID='7cf8a585-34a8-470c-9b40-9d486456fcea', spUUID='2ecfa6dd-a1fa-428c-9d04-db17c6c4491d', imgUUID='8cbc183a-0afe-497a-a54b-ec5f03b7fe16', volUUID='bcb8d500-5aa4-404a-9be2-9fb84bc87300', options=None) Thread-550892::DEBUG::2013-07-17 19:06:14,336::resourceManager::175::ResourceManager.Request::(__init__) ResName=`Storage.7cf8a585-34a8-470c-9b40-9d486456fcea`ReqID=`7411edba-20b8-4855-8271-b6aa52a9203a`::Request was made in '/usr/share/vdsm/storage/resourceManager.py' line '485' at 'registerResource' Thread-550892::DEBUG::2013-07-17 19:06:14,336::resourceManager::486::ResourceManager::(registerResource) Trying to register resource 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea' for lock type 'shared' Thread-550892::DEBUG::2013-07-17 19:06:14,336::resourceManager::528::ResourceManager::(registerResource) Resource 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea' is free. Now locking as 'shared' (1 active user) Thread-550892::DEBUG::2013-07-17 19:06:14,336::resourceManager::212::ResourceManager.Request::(grant) ResName=`Storage.7cf8a585-34a8-470c-9b40-9d486456fcea`ReqID=`7411edba-20b8-4855-8271-b6aa52a9203a`::Granted request Thread-550892::DEBUG::2013-07-17 19:06:14,337::task::817::TaskManager.Task::(resourceAcquired) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::_resourcesAcquired: Storage.7cf8a585-34a8-470c-9b40-9d486456fcea (shared) Thread-550892::DEBUG::2013-07-17 19:06:14,337::task::978::TaskManager.Task::(_decref) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::ref 1 aborting False Thread-550892::INFO::2013-07-17 19:06:14,338::logUtils::39::dispatcher::(wrapper) Run and protect: getVolumeSize, Return response: {'truesize': '31138512896', 'apparentsize': '31138512896'} Thread-550892::DEBUG::2013-07-17 19:06:14,338::task::1172::TaskManager.Task::(prepare) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::finished: {'truesize': '31138512896', 'apparentsize': '31138512896'} Thread-550892::DEBUG::2013-07-17 19:06:14,338::task::588::TaskManager.Task::(_updateState) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::moving from state preparing - state finished Thread-550892::DEBUG::2013-07-17 19:06:14,338::resourceManager::809::ResourceManager.Owner::(releaseAll) Owner.releaseAll requests {} resources {'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea': ResourceRef 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea', isValid: 'True' obj: 'None'} Thread-550892::DEBUG::2013-07-17 19:06:14,338::resourceManager::844::ResourceManager.Owner::(cancelAll) Owner.cancelAll requests {} Thread-550892::DEBUG::2013-07-17 19:06:14,338::resourceManager::538::ResourceManager::(releaseResource) Trying to release resource 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea'
Re: [Users] HELP, ISCSI Freakin!
Forgot one thing, the paste bin link! http://pastebin.com/3t7E41JK Date: Wednesday, July 17, 2013 7:08 PM To: users users@ovirt.orgmailto:users@ovirt.org Subject: HELP, ISCSI Freakin! Hey Guys, I am running oVirt 3.1; and I am getting the following in dmesg, and I have pasted my VG info below. We are running iSCSI luns and this is a node complaining in the log. Our equalogic storage was close to full; and this started happening. I have extended it, and it is thin provisioned; but I am not sure if a pvresize is needed. If so, can it be done while online? Trying not to take down this cluster; several heavily used VMs on it. The VMs seem to be running ok for now; but this is filling up the logs pretty quickly; as it appears to be retrying to connections and to release the resources. All help is appreciated. Thanks for all your help guys; and feel free to email me directly if I can provide any more info. Log Snippet: Thread-550892::DEBUG::2013-07-17 19:06:14,335::resourceManager::565::ResourceManager::(releaseResource) No one is waiting for resource 'Storage.69d66cf5-9426-4385-be6c-c55e2bd40617', Clearing records. Thread-550892::DEBUG::2013-07-17 19:06:14,335::task::978::TaskManager.Task::(_decref) Task=`03810e67-5dab-4193-8cbd-c633c8c25fa4`::ref 0 aborting False Thread-550892::DEBUG::2013-07-17 19:06:14,336::task::588::TaskManager.Task::(_updateState) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::moving from state init - state preparing Thread-550892::INFO::2013-07-17 19:06:14,336::logUtils::37::dispatcher::(wrapper) Run and protect: getVolumeSize(sdUUID='7cf8a585-34a8-470c-9b40-9d486456fcea', spUUID='2ecfa6dd-a1fa-428c-9d04-db17c6c4491d', imgUUID='8cbc183a-0afe-497a-a54b-ec5f03b7fe16', volUUID='bcb8d500-5aa4-404a-9be2-9fb84bc87300', options=None) Thread-550892::DEBUG::2013-07-17 19:06:14,336::resourceManager::175::ResourceManager.Request::(__init__) ResName=`Storage.7cf8a585-34a8-470c-9b40-9d486456fcea`ReqID=`7411edba-20b8-4855-8271-b6aa52a9203a`::Request was made in '/usr/share/vdsm/storage/resourceManager.py' line '485' at 'registerResource' Thread-550892::DEBUG::2013-07-17 19:06:14,336::resourceManager::486::ResourceManager::(registerResource) Trying to register resource 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea' for lock type 'shared' Thread-550892::DEBUG::2013-07-17 19:06:14,336::resourceManager::528::ResourceManager::(registerResource) Resource 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea' is free. Now locking as 'shared' (1 active user) Thread-550892::DEBUG::2013-07-17 19:06:14,336::resourceManager::212::ResourceManager.Request::(grant) ResName=`Storage.7cf8a585-34a8-470c-9b40-9d486456fcea`ReqID=`7411edba-20b8-4855-8271-b6aa52a9203a`::Granted request Thread-550892::DEBUG::2013-07-17 19:06:14,337::task::817::TaskManager.Task::(resourceAcquired) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::_resourcesAcquired: Storage.7cf8a585-34a8-470c-9b40-9d486456fcea (shared) Thread-550892::DEBUG::2013-07-17 19:06:14,337::task::978::TaskManager.Task::(_decref) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::ref 1 aborting False Thread-550892::INFO::2013-07-17 19:06:14,338::logUtils::39::dispatcher::(wrapper) Run and protect: getVolumeSize, Return response: {'truesize': '31138512896', 'apparentsize': '31138512896'} Thread-550892::DEBUG::2013-07-17 19:06:14,338::task::1172::TaskManager.Task::(prepare) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::finished: {'truesize': '31138512896', 'apparentsize': '31138512896'} Thread-550892::DEBUG::2013-07-17 19:06:14,338::task::588::TaskManager.Task::(_updateState) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::moving from state preparing - state finished Thread-550892::DEBUG::2013-07-17 19:06:14,338::resourceManager::809::ResourceManager.Owner::(releaseAll) Owner.releaseAll requests {} resources {'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea': ResourceRef 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea', isValid: 'True' obj: 'None'} Thread-550892::DEBUG::2013-07-17 19:06:14,338::resourceManager::844::ResourceManager.Owner::(cancelAll) Owner.cancelAll requests {} Thread-550892::DEBUG::2013-07-17 19:06:14,338::resourceManager::538::ResourceManager::(releaseResource) Trying to release resource 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea' Thread-550892::DEBUG::2013-07-17 19:06:14,338::resourceManager::553::ResourceManager::(releaseResource) Released resource 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea' (0 active users) Thread-550892::DEBUG::2013-07-17 19:06:14,338::resourceManager::558::ResourceManager::(releaseResource) Resource 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea' is free, finding out if anyone is waiting for it. Thread-550892::DEBUG::2013-07-17 19:06:14,339::resourceManager::565::ResourceManager::(releaseResource) No one is waiting for resource 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea', Clearing records. Thread-550892::DEBUG::2013-07-17
[Users] HELP, ISCSI Freakin!
Hey Guys, I am running oVirt 3.1; and I am getting the following in dmesg, and I have pasted my VG info below. We are running iSCSI luns and this is a node complaining in the log. Our equalogic storage was close to full; and this started happening. I have extended it, and it is thin provisioned; but I am not sure if a pvresize is needed. If so, can it be done while online? Trying not to take down this cluster; several heavily used VMs on it. The VMs seem to be running ok for now; but this is filling up the logs pretty quickly; as it appears to be retrying to connections and to release the resources. All help is appreciated. Thanks for all your help guys; and feel free to email me directly if I can provide any more info. Log Snippet: Thread-550892::DEBUG::2013-07-17 19:06:14,335::resourceManager::565::ResourceManager::(releaseResource) No one is waiting for resource 'Storage.69d66cf5-9426-4385-be6c-c55e2bd40617', Clearing records. Thread-550892::DEBUG::2013-07-17 19:06:14,335::task::978::TaskManager.Task::(_decref) Task=`03810e67-5dab-4193-8cbd-c633c8c25fa4`::ref 0 aborting False Thread-550892::DEBUG::2013-07-17 19:06:14,336::task::588::TaskManager.Task::(_updateState) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::moving from state init - state preparing Thread-550892::INFO::2013-07-17 19:06:14,336::logUtils::37::dispatcher::(wrapper) Run and protect: getVolumeSize(sdUUID='7cf8a585-34a8-470c-9b40-9d486456fcea', spUUID='2ecfa6dd-a1fa-428c-9d04-db17c6c4491d', imgUUID='8cbc183a-0afe-497a-a54b-ec5f03b7fe16', volUUID='bcb8d500-5aa4-404a-9be2-9fb84bc87300', options=None) Thread-550892::DEBUG::2013-07-17 19:06:14,336::resourceManager::175::ResourceManager.Request::(__init__) ResName=`Storage.7cf8a585-34a8-470c-9b40-9d486456fcea`ReqID=`7411edba-20b8-4855-8271-b6aa52a9203a`::Request was made in '/usr/share/vdsm/storage/resourceManager.py' line '485' at 'registerResource' Thread-550892::DEBUG::2013-07-17 19:06:14,336::resourceManager::486::ResourceManager::(registerResource) Trying to register resource 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea' for lock type 'shared' Thread-550892::DEBUG::2013-07-17 19:06:14,336::resourceManager::528::ResourceManager::(registerResource) Resource 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea' is free. Now locking as 'shared' (1 active user) Thread-550892::DEBUG::2013-07-17 19:06:14,336::resourceManager::212::ResourceManager.Request::(grant) ResName=`Storage.7cf8a585-34a8-470c-9b40-9d486456fcea`ReqID=`7411edba-20b8-4855-8271-b6aa52a9203a`::Granted request Thread-550892::DEBUG::2013-07-17 19:06:14,337::task::817::TaskManager.Task::(resourceAcquired) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::_resourcesAcquired: Storage.7cf8a585-34a8-470c-9b40-9d486456fcea (shared) Thread-550892::DEBUG::2013-07-17 19:06:14,337::task::978::TaskManager.Task::(_decref) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::ref 1 aborting False Thread-550892::INFO::2013-07-17 19:06:14,338::logUtils::39::dispatcher::(wrapper) Run and protect: getVolumeSize, Return response: {'truesize': '31138512896', 'apparentsize': '31138512896'} Thread-550892::DEBUG::2013-07-17 19:06:14,338::task::1172::TaskManager.Task::(prepare) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::finished: {'truesize': '31138512896', 'apparentsize': '31138512896'} Thread-550892::DEBUG::2013-07-17 19:06:14,338::task::588::TaskManager.Task::(_updateState) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::moving from state preparing - state finished Thread-550892::DEBUG::2013-07-17 19:06:14,338::resourceManager::809::ResourceManager.Owner::(releaseAll) Owner.releaseAll requests {} resources {'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea': ResourceRef 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea', isValid: 'True' obj: 'None'} Thread-550892::DEBUG::2013-07-17 19:06:14,338::resourceManager::844::ResourceManager.Owner::(cancelAll) Owner.cancelAll requests {} Thread-550892::DEBUG::2013-07-17 19:06:14,338::resourceManager::538::ResourceManager::(releaseResource) Trying to release resource 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea' Thread-550892::DEBUG::2013-07-17 19:06:14,338::resourceManager::553::ResourceManager::(releaseResource) Released resource 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea' (0 active users) Thread-550892::DEBUG::2013-07-17 19:06:14,338::resourceManager::558::ResourceManager::(releaseResource) Resource 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea' is free, finding out if anyone is waiting for it. Thread-550892::DEBUG::2013-07-17 19:06:14,339::resourceManager::565::ResourceManager::(releaseResource) No one is waiting for resource 'Storage.7cf8a585-34a8-470c-9b40-9d486456fcea', Clearing records. Thread-550892::DEBUG::2013-07-17 19:06:14,339::task::978::TaskManager.Task::(_decref) Task=`7379fc72-5137-4432-861c-85b1f33ba843`::ref 0 aborting False Dmesg Snippet: __ratelimit: 28 callbacks suppressed end_request: I/O error, dev dm-64, sector 5769088 end_request: