Summary: | sys-kernel/gentoo-sources-2.6.33, app-emulation/qemu-kvm-0.12.3 - software raid - buffer io errors | ||
---|---|---|---|
Product: | Gentoo Linux | Reporter: | Jochen Schlick <josch09> |
Component: | [OLD] Core system | Assignee: | Gentoo Kernel Bug Wranglers and Kernel Maintainers <kernel> |
Status: | RESOLVED FIXED | ||
Severity: | normal | ||
Priority: | High | ||
Version: | unspecified | ||
Hardware: | AMD64 | ||
OS: | Linux | ||
Whiteboard: | |||
Package list: | Runtime testing required: | --- | |
Attachments: |
emerge --info
2.6.33 gentoo sources - kernel config |
Description
Jochen Schlick
2010-04-03 03:06:44 UTC
*** Bug 312931 has been marked as a duplicate of this bug. *** the result of further investigation is that everything works as expected when I create the virtual harddisk file (15GBFile.raw) on a XFS filesystem which is not on a linux raid device (not on /dev/md1) - but on the same real hard disks that are also used for the raid10 device /dev/md1. Please attach your kernel .config and the output of emerge --info. Created attachment 226539 [details]
emerge --info
Created attachment 226545 [details]
2.6.33 gentoo sources - kernel config
I hope this helps a little bit. ps: I have copied the 15GBFile.raw-file to the raid10 device and use it as a additional harddisk for the qemu-guest. no errors at the moment. perhaps there is a problem with sparse files (qemu-raw file format) and they will be expanded by the guest (with mkfs.ext4) and the raw-file itself is on raid10/XFS. update: it happens with 2.6.33-gentoo-r1 too but I found this: http://marc.info/?l=linux-raid&m=126802743419044&w=2 https://patchwork.kernel.org/patch/83932/ I think I test the mentioned patches. can't try it out because patch https://patchwork.kernel.org/patch/83932/ isn't applicable. drivers/md/dm-table.c: In function 'dm_set_device_limits': drivers/md/dm-table.c:532: error: 'struct queue_limits' has no member named 'max_segments' make[2]: *** [drivers/md/dm-table.o] Error 1 make[1]: *** [drivers/md] Error 2 make: *** [drivers] Error 2 here my result of further tests. It does NOT depend on qemu/kvm and the virtual machine stuff (as I initially thought). It's a problem with raid10 and the XFS file system. Copying 100GB of data from a non raid partition to the XFS/raid10 file system triggers this BUffer IO error >100 times. And when I compare the files - they are really different. The combination of XFS and raid10 is unusable with a 2.6.33 kernel. Let's see what happens when I replace XFS with EXT4 on the raid10 dev.... It must be a problem with the combination of XFS and the underlying linux raid10 and nothing else.. I replaced XFS with an ext4 file system and what a surprise - no errors at all. ext4 is not able to trigger this bug. I copied several hundreds of gigs between the non-raid and raid10 partitions and got no errors in the logs and no diffs between the files. After two weeeks of testing with different vanilla 2.6.34-rcX kernels and the current 2.6.34 kernel I can say that the poblem (XFS-RAID10) seems to fixed in the 2.6.34 kernels. At least it is no longer reproducible. Cool, Jochen! I'll go ahead and close the bug, but if you feel that it should be reopened do go ahead and do it. |