git.proxmox.com Git - mirror_zfs-debian.git/log

]> git.proxmox.com Git - mirror_zfs-debian.git/log

projects / mirror_zfs-debian.git / log

commit | commitdiff | tree

Darik Horn [Fri, 13 Apr 2012 14:06:31 +0000 (09:06 -0500)]

Merge branch 'upstream'

commit | commitdiff | tree

P.SCH [Mon, 9 Apr 2012 06:05:54 +0000 (09:05 +0300)]

ZFS list snapshot property alias

Add support for the `zfs list -t snap` alias which is available under
Oracle Solaris 11.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #640

commit | commitdiff | tree

P.SCH [Thu, 5 Apr 2012 13:03:21 +0000 (16:03 +0300)]

ZFS snapshot alias

For consistency, and because it's handy, add the 'zfs snap' alias which
was introduced by Oracle Solaris 11. This includes an update to the
man page to reflect all the available alias (snap, umount, and recv).

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #640

commit | commitdiff | tree

Martin Matuska [Sun, 8 Apr 2012 17:16:46 +0000 (13:16 -0400)]

Illumos #1346: zfs incremental receive may leave behind temporary clones

1356 zfs dataset prefetch code not working
Reviewed by: Matthew Ahrens <matt@delphix.com>
Reviewed by: Dan McDonald <danmcd@nexenta.com>
Approved by: Gordon Ross <gwr@nexenta.com>

References to Illumos issue:
https://www.illumos.org/issues/1346
https://www.illumos.org/issues/1356

Ported-by: Richard Yao <ryao@cs.stonybrook.edu>
Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #647

commit | commitdiff | tree

Albert Lee [Sun, 8 Apr 2012 17:10:49 +0000 (13:10 -0400)]

Illumos #1475: zfs spill block hold can access invalid spill blkptr

Reviewed by: Dan McDonald <danmcd@nexenta.com>
Reviewed by: Gordon Ross <gwr@nexenta.com>
Reviewed by: Matthew Ahrens <mahrens@delphix.com>
Reviewed by: George Wilson <gwilson@zfsmail.com>
Approved by: Garrett D'Amore <garrett@nexenta.com>

References to Illumos issue:
https://www.illumos.org/issues/1475

Ported-by: Richard Yao <ryao@cs.stonybrook.edu>
Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #648

commit | commitdiff | tree

George Wilson [Sun, 8 Apr 2012 17:23:08 +0000 (13:23 -0400)]

Illumos #1951: leaking a vdev when removing an l2cache device

1952 memory leak when adding a file-based l2arc device
1954 leak in ZFS from metaslab_group_create and zfs_ereport_checksum

Reviewed by: Adam Leventhal <ahl@delphix.com>
Reviewed by: Matt Ahrens <matt@delphix.com>
Reviewed by: Eric Schrock <eric.schrock@delphix.com>
Reviewed by: Bill Pijewski <wdp@joyent.com>
Reviewed by: Dan McDonald <danmcd@nexenta.com>
Approved by: Eric Schrock <eric.schrock@delphix.com>

References to Illumos issues:
  https://www.illumos.org/issues/1951
  https://www.illumos.org/issues/1952
  https://www.illumos.org/issues/1954

Ported-by: Richard Yao <ryao@cs.stonybrook.edu>
Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #650

commit | commitdiff | tree

Martin Matuska [Sun, 8 Apr 2012 17:18:48 +0000 (13:18 -0400)]

OS-926: zfs panic in zfs_fill_zplprops_impl()

This change appears to be exclusive to SmartOS. It is not present in
illumos-gate but it just adds some needed error handling. This is
clearly preferable to simply ASSERTING which is what would occur
prior to the patch.

Reviewed by: Jerry Jelinek <jerry.jelinek@joyent.com>
Reviewed by: Matt Ahrens <matt@delphix.com>
Ported-by: Richard Yao <ryao@cs.stonybrook.edu>
Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #652

commit | commitdiff | tree

Andriy Gapon [Mon, 9 Apr 2012 17:01:46 +0000 (13:01 -0400)]

Illumos #1680: zfs vdev_file_io_start: validate vdev before using vdev_tsd

vdev_tsd can be NULL for certain vdev states.
At least in userland testing with ztest.

References to Illumos issue:
https://www.illumos.org/issues/1680

Ported-by: Richard Yao <ryao@cs.stonybrook.edu>
Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #655

commit | commitdiff | tree

Richard Laager [Mon, 9 Apr 2012 21:59:37 +0000 (16:59 -0500)]

Improve error message consistency

Signed-off-by: Richard Laager <rlaager@wiktel.com>
Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Richard Laager [Wed, 11 Apr 2012 01:03:26 +0000 (20:03 -0500)]

Document the zle compression algorithm

Signed-off-by: Richard Laager <rlaager@wiktel.com>
Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Brian Behlendorf [Wed, 4 Apr 2012 20:46:55 +0000 (13:46 -0700)]

Export additional dsl symbols

Principly these symbols were exported to get access to the
dsl_prop_register/dsl_prop_unregister functions. They allow
us to cleanly register a callback which is called when a
dataset property is modified.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Darik Horn [Mon, 9 Apr 2012 14:50:42 +0000 (09:50 -0500)]

PPA 0.6.0.58-0ubuntu1 release.

commit | commitdiff | tree

Darik Horn [Mon, 9 Apr 2012 14:48:52 +0000 (09:48 -0500)]

Add dkms kernel source exception for Proxmox 2.0

Proxmox 2.0 identifies itself as Debian, but provides a non-Debian
kernel package. Accomodate the configuration difference in the
dkms.conf file.

Closes: zfsonlinux/zfs#630

commit | commitdiff | tree

Darik Horn [Mon, 9 Apr 2012 14:45:48 +0000 (09:45 -0500)]

Refresh debian/patches after upstream merge.

commit | commitdiff | tree

Darik Horn [Mon, 9 Apr 2012 14:43:05 +0000 (09:43 -0500)]

Merge branch 'upstream'

commit | commitdiff | tree

Gunnar Beutner [Thu, 5 Apr 2012 08:30:10 +0000 (10:30 +0200)]

Fixed a NULL pointer dereference bug in zfs_preumount

When zpl_fill_super -> zfs_domount fails (e.g. because the dataset
was destroyed before it could be successfully mounted) the subsequent
call to zpl_kill_sb -> zfs_preumount would derefence a NULL pointer.

This bug can be reproduced using this shell script:

#!/bin/sh
(
while true; do
zfs create -o mountpoint=legacz tank/bar
zfs destroy tank/bar
done
) &

(
while true; do
mount -t zfs tank/bar /mnt
umount /mnt
done
) &

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #639

commit | commitdiff | tree

Richard Yao [Tue, 3 Apr 2012 04:21:22 +0000 (00:21 -0400)]

Make Gentoo initscript use modinfo

The -l parameter to modprobe has been removed from the latest upstream
code and this change has entered Gentoo. Using modinfo as a substitute
addresses this.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #636

commit | commitdiff | tree

Richard Yao [Tue, 3 Apr 2012 00:44:42 +0000 (20:44 -0400)]

Print human readable error message for ENOENT

A cryptic error code is printed when mounting a legacy dataset to a
non-existent mountpoint. This patch changes this behavior to print
"mount point '%s' does not exist", which is similar to the error
message printed when mounting procfs.

The single quotes were added to be consistent with the existing EBUSY
error message, which is the only difference between this error message
and the one that is printed when the same condition occurs when mounting
procfs.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #633

commit | commitdiff | tree

Darik Horn [Tue, 3 Apr 2012 04:07:10 +0000 (23:07 -0500)]

PPA 0.6.0.57-0ubuntu2 release.

commit | commitdiff | tree

Darik Horn [Tue, 3 Apr 2012 03:53:59 +0000 (22:53 -0500)]

Reorder zfs-dkms.postinst for dh_dkms glitch.

The #DEBHELPER# macro produces code for DKMS that can cause sporadic
failures if it is not run last. Identical code is executed differently
depending on whether it is before or after the template expansion.

Closes: zfsonlinux/zfs#629

commit | commitdiff | tree

Darik Horn [Sat, 31 Mar 2012 00:06:12 +0000 (19:06 -0500)]

Merge branch 'upstream'

commit | commitdiff | tree

Darik Horn [Fri, 30 Mar 2012 05:39:41 +0000 (00:39 -0500)]

PPA 0.6.0.57-0ubuntu1 release.

commit | commitdiff | tree

Darik Horn [Fri, 30 Mar 2012 05:37:35 +0000 (00:37 -0500)]

Enable 0007-Integrate-ARC-more-tightly-with-Linux.patch

commit | commitdiff | tree

Darik Horn [Fri, 30 Mar 2012 05:36:50 +0000 (00:36 -0500)]

Refresh debian/patches after upstream merge.

commit | commitdiff | tree

Darik Horn [Fri, 30 Mar 2012 04:53:12 +0000 (23:53 -0500)]

Add missing #DEBHELPER# macro to zfsutils.

The #DEBHELPER# macro was missing from the zfsutils.postinst file,
which caused a failure to register the /etc/init.d/zfs-mount and
/etc/init.d/zfs-share scripts through the update-rc.d utility.

For completeness, also add the #DEBHELPER# macro to the existing
zfsutils.postrm file, and create a zfsutils.prerm file for it too.

Closes: dajhorn/pkg-zfs#28

commit | commitdiff | tree

Darik Horn [Fri, 30 Mar 2012 03:04:01 +0000 (22:04 -0500)]

Add libgcc_s.so.1 to the zfs-initramfs manifest.

The `zdb` utility requires pthreads for some functions, but the
necessary library is not automatically added to the initrd because
it is not detected as a dependency.

Closes: dajhorn/pkg-zfs#26

commit | commitdiff | tree

Darik Horn [Fri, 30 Mar 2012 02:31:48 +0000 (21:31 -0500)]

Invoke notify-reboot-required for upgrades.

Call `/usr/share/update-notifier/notify-reboot-required` in the
zfs-dkms postinst script to advise the system that kernel module
upgrades are activated by a reboot.

Closes: dajhorn/pkg-zfs#25

commit | commitdiff | tree

Brian Behlendorf [Tue, 27 Mar 2012 17:10:26 +0000 (10:10 -0700)]

Properly expose the mfu ghost list kstats

Due to a typo the mru ghost lists stats were accidentally being
exposed as the mfu ghost list stats. This was harmless but
confusing since memory usage could be over reported.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Craig Sanders [Tue, 27 Mar 2012 10:37:41 +0000 (21:37 +1100)]

Remove hard-coded 80 column output

When stdout is detected to be a tty use the number of columns
specified by the terminal. If that fails fall back to a default
80 column width. In the non-tty case allow for 999 column lines.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Darik Horn [Tue, 27 Mar 2012 02:20:42 +0000 (21:20 -0500)]

PPA 0.6.0.56-0ubuntu1 release.

commit | commitdiff | tree

Darik Horn [Tue, 27 Mar 2012 02:19:03 +0000 (21:19 -0500)]

Merge branch 'upstream'

commit | commitdiff | tree

Darik Horn [Tue, 27 Mar 2012 02:17:43 +0000 (21:17 -0500)]

Disable 0007-Integrate-ARC-more-tightly-with-Linux.patch

This patch is experimental, so disable it for general release.

commit | commitdiff | tree

Brian Behlendorf [Mon, 26 Mar 2012 18:55:16 +0000 (11:55 -0700)]

ZFS 0.6.0-rc8

commit | commitdiff | tree

Brian Behlendorf [Mon, 26 Mar 2012 17:34:55 +0000 (10:34 -0700)]

Fix executable permissions

Caught by lint, this permission change was accidentally introduced
by commit 42cb3819f1a1f536105faac81ffc150f3da90a80. Restore the
correct permissions and while I'm at it add a missing whack-bang
to config/ltmain.sh.

lint: executable-not-elf-or-script: zpool_main.c zfs_main.c

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #620

commit | commitdiff | tree

Darik Horn [Sat, 24 Mar 2012 01:38:45 +0000 (20:38 -0500)]

PPA 0.6.0.55-0ubuntu1 release.

commit | commitdiff | tree

Darik Horn [Sat, 24 Mar 2012 01:37:24 +0000 (20:37 -0500)]

Refresh debian/patches after upstream merge.

commit | commitdiff | tree

Darik Horn [Sat, 24 Mar 2012 01:36:52 +0000 (20:36 -0500)]

Merge branch 'upstream'

commit | commitdiff | tree

Brian Behlendorf [Tue, 20 Mar 2012 23:00:17 +0000 (16:00 -0700)]

Add --enable-debug-dmu-tx configure option

Allow rigorous (and expensive) tx validation to be enabled/disabled
indepentantly from the standard zfs debugging.  When enabled these
checks ensure that all txs are constructed properly and that a dbuf
is never dirtied without taking the correct tx hold.

This checking is particularly helpful when adding new dmu consumers
like Lustre.  However, for established consumers such as the zpl
with no known outstanding tx construction problems this is just
overhead.

--enable-debug-dmu-tx  - Enable/disable validation of each tx as
--disable-debug-dmu-tx   it is constructed.  By default validation
                         is disabled due to performance concerns.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Brian Behlendorf [Mon, 12 Mar 2012 19:38:00 +0000 (12:38 -0700)]

Enhance a dmu_tx_dirty_buf() assertion

The following assertion is good to validate the correctness of
new DMU consumers, but it doesn't quite provide enough information.
Slightly rework the assertion so that when it is hit the actual
offending values will be included in the output.

SPLError: 4787:0:(dmu_tx.c:828:dmu_tx_dirty_buf())
ASSERTION(dn == NULL || dn->dn_assigned_txg == tx->tx_txg) failed

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Brian Behlendorf [Fri, 23 Mar 2012 18:59:47 +0000 (11:59 -0700)]

Add ZFS_META_RELEASE to module load/unload messages

Include the ZFS_META_RELEASE in the module load/unload messages
to more clearly indidcate exactly what version of ZFS has been
loaded.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Brian Behlendorf [Thu, 22 Mar 2012 21:47:29 +0000 (14:47 -0700)]

Account for .zfs ctldir inodes

Because the .zfs ctldir inodes are not backed by physical storage
they use a different create path which was not properly accounting
for them as used. This could result in ->nr_cached_objects()
returning 0 and cause a divide by zero error in prune_super().

In my option there's a kernel bug here too which allows this to
happen. They should either be checking for 0 or adding +1 like
they correctly do earlier in the function.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #617

commit | commitdiff | tree

Brian Behlendorf [Fri, 11 Nov 2011 07:15:53 +0000 (12:45 +0530)]

Add .zfs control directory

Add support for the .zfs control directory.  This was accomplished
by leveraging as much of the existing ZFS infrastructure as posible
and updating it for Linux as required.  The bulk of the core
functionality is now all there with the following limitations.

*) The .zfs/snapshot directory automount support requires a 2.6.37
   or newer kernel.  The exception is RHEL6.2 which has backported
   the d_automount patches.

*) Creating/destroying/renaming snapshots with mkdir/rmdir/mv
   in the .zfs/snapshot directory works as expected.  However,
   this functionality is only available to root until zfs
   delegations are finished.

      * mkdir - create a snapshot
      * rmdir - destroy a snapshot
      * mv    - rename a snapshot

The following issues are known defeciences, but we expect them to
be addressed by future commits.

*) Add automount support for kernels older the 2.6.37.  This should
   be possible using follow_link() which is what Linux did before.

*) Accessing the .zfs/snapshot directory via NFS is not yet possible.
   The majority of the ground work for this is complete.  However,
   finishing this work will require resolving some lingering
   integration issues with the Linux NFS kernel server.

*) The .zfs/shares directory exists but no futher smb functionality
   has yet been implemented.

Contributions-by: Rohan Puri <rohan.puri15@gmail.com>
Contributiobs-by: Andrew Barnes <barnes333@gmail.com>
Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #173

commit | commitdiff | tree

Darik Horn [Wed, 21 Mar 2012 23:27:13 +0000 (18:27 -0500)]

Add: Add-.zfs-control-directory.patch

Add support for the .zfs control directory.  This was accomplished
by leveraging as much of the existing ZFS infrastructure as posible
and updating it for Linux as required.  The bulk of the core
functionality is now all there with the following limitations.

*) The .zfs/snapshot directory automount support requires a 2.6.37
   or newer kernel.  The exception is RHEL6.2 which has backported
   the d_automount patches.

*) Creating/destroying/renaming snapshots with mkdir/rmdir/mv
   in the .zfs/snapshot directory works as expected.  However,
   this functionality is only available to root until zfs
   delegations are finished.

      * mkdir - create a snapshot
      * rmdir - destroy a snapshot
      * mv    - rename a snapshot

The following issues are known defeciences, but we expect them to
be addressed by future commits.

*) Add automount support for kernels older the 2.6.37.  This should
   be possible using follow_link() which is what Linux did before.

*) Accessing the the .zfs/snapshot directory via NFS is not yet
   possible.  The majority of the ground work for this is complete.
   However, finishing this work will require resolving some lingering
   integration issues with the Linux NFS kernel server.

*) The .zfs/shares directory exists but no futher smb functionality
   has yet been implemented.

Contributions-by: Rohan Puri <rohan.puri15@gmail.com>
Contributiobs-by: Andrew Barnes <barnes333@gmail.com>
Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Issue #173

commit | commitdiff | tree

Darik Horn [Wed, 21 Mar 2012 23:08:19 +0000 (18:08 -0500)]

Merge branch 'master-0.6.7-rc7.1'

commit | commitdiff | tree

Darik Horn [Wed, 21 Mar 2012 22:44:43 +0000 (17:44 -0500)]

PPA 0.6.0.54-1ubuntu1 release.

commit | commitdiff | tree

Darik Horn [Wed, 21 Mar 2012 22:42:05 +0000 (17:42 -0500)]

Merge branch 'upstream' into master-0.6.7-rc7.1

commit | commitdiff | tree

Brian Behlendorf [Thu, 23 Feb 2012 23:32:51 +0000 (15:32 -0800)]

Add zio constructor/destructor

Add a standard zio constructor and destructor.  Normally, this is
done to reduce to cost of allocating a new structure by reducing
expensive operations such as memory allocations.  However, in this
case none of the operations moved out of zio_create() were really
very expensive.

This change was principly made as a debug patch (and workaround)
for a zio_destroy() race.  The is good evidence that zio_create()
is reinitializing a mutex which is really still in use by another
thread.  This would completely explain the observed symptoms in
the issue report.

This patch doesn't fix the root cause of the race, but it should
make it less likely by only initializing the mutex once in the
constructor.  Also, this particular flaw might have gone unnoticed
in other zfs implementations due to the specific implementation
details of Linux ticket spinlocks.

Once the real root cause is determined and resolved this change
can be safely reverted.  Until then this should help workaround
the issue.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Issue #496

commit | commitdiff | tree

Brian Behlendorf [Wed, 21 Mar 2012 18:51:27 +0000 (11:51 -0700)]

Revert "Add zio constructor/destructor"

This patch was slightly flawed and allowed for zio->io_logical
to potentially not be reinitialized for a new zio.  This could
lead to assertion failures in specific cases when debugging is
enabled (--enable-debug) and I/O errors are encountered.  It
may also have caused problems when issues logical I/Os.

Since we want to make sure this workaround can be easily removed
in the future (when we have the real fix).  I'm reverting this
change and applying a new version of the patch which includes
the zio->io_logical fix.

This reverts commit 2c6d0b1e07b0265f0661ed7851d3aa8d3e75e7a9.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Issue #602
Issue #604

commit | commitdiff | tree

Darik Horn [Sat, 17 Mar 2012 05:39:36 +0000 (00:39 -0500)]

Add 0007-Integrate-ARC-more-tightly-with-Linux.patch

commit | commitdiff | tree

Darik Horn [Sat, 17 Mar 2012 04:41:43 +0000 (23:41 -0500)]

PPA 0.6.0.54-0ubuntu1 release.

commit | commitdiff | tree

Darik Horn [Sat, 17 Mar 2012 04:40:31 +0000 (23:40 -0500)]

Merge branch 'upstream'

commit | commitdiff | tree

Brian Behlendorf [Fri, 16 Mar 2012 18:11:40 +0000 (11:11 -0700)]

ZFS 0.6.0-rc7

commit | commitdiff | tree

Brian Behlendorf [Wed, 14 Mar 2012 19:36:49 +0000 (12:36 -0700)]

Add missing NULL in zpl_xattr_handlers

The xattr_resolve_name() helper function expects the registered
list of xattr handlers to be NULL terminated. This NULL was
accidentally missing which could result in a NULL dereference.

Interestingly this issue only manifested itself on certain 32-bit
systems. Presumably on 64-bit kernels we just always happen to
get lucky and the memory following the structure is zeroed.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Issue #594

commit | commitdiff | tree

Gregor Kopka [Wed, 14 Mar 2012 13:34:00 +0000 (14:34 +0100)]

Use stderr for 'no pools/datasets available' error

The 'zfs list' and 'zpool list' commands output the message
'no datasets/pools available' to stdout. This should go to
stderr and only the available datasets/pools should go to
stdout. Returning nothing to stdout is expected behavior
when there is nothing to list.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #581

commit | commitdiff | tree

Darik Horn [Sat, 10 Mar 2012 02:56:17 +0000 (20:56 -0600)]

PPA 0.6.0.53-0ubuntu1 release.

commit | commitdiff | tree

Darik Horn [Sat, 10 Mar 2012 02:52:38 +0000 (20:52 -0600)]

Merge branch 'upstream'

commit | commitdiff | tree

Brian Behlendorf [Mon, 5 Mar 2012 23:14:15 +0000 (15:14 -0800)]

Add sa_spill_rele() interface

Add a SA interface which allows us to release the spill block
from a SA handle without destroying the handle. This is useful
because we can then ensure that a copy of the dirty spill block
is not made at sync time due to the extra hold. Susequent calls
to sa_update() or sa_lookup() with transparently refetch the
spill block dbuf from the ARC hash.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Brian Behlendorf [Thu, 23 Feb 2012 23:32:51 +0000 (15:32 -0800)]

commit | commitdiff | tree

Richard Yao [Tue, 28 Feb 2012 21:23:16 +0000 (16:23 -0500)]

Fix distribution detection

Improve the distribution detection by moving the tests for
distribution specific files first. The Ubuntu and Debian
checks are left for last because they are the least likely
to be unique. This is particularly true in the case of Debian
since so many distributions are based on Debian.

Since this is currently only used to identify the correct
packaging method for this system the result in many instances
is simply cosmetic.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Ned Bass [Wed, 29 Feb 2012 18:08:20 +0000 (10:08 -0800)]

Align parition end on 1 MiB boundary

Some devices have exhibited sensitivity to the ending alignment of
partitions.  In particular, even if the first partition begins at 1
MiB, we have seen many sd driver task abort errors with certain SSDs
if the first partition doesn't end on a 1 MiB boundary.  This occurs
when the vdev label is read during pool creation or importation and
causes a delay of about 30 seconds per device.  It can also be
simulated with dd when the pool isn't imported:

  dd if=/dev/sda1 of=/dev/null bs=262144 count=1

For the record, this problem was observed with SMARTMOD
SG9XCA2E200GE01 200GB SSDs.  Unfortunately I don't have a good
explanation for this behavior. It seems to have something to do with
highly fragmented single-sector requests being issued to the device,
which it may not support.  With end-aligned partitions at least
page-sized requests were queued and issued to the driver according
to blktrace. In any case, aligning the partition end is a fairly
innocuous work-around, wasting at most 1 MiB of space.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #574

commit | commitdiff | tree

Darik Horn [Sat, 3 Mar 2012 14:34:41 +0000 (08:34 -0600)]

Remove the DISCLAIMER file from the docs list.

The upstream disclaimer was copied into the debian/copyright file
according to DEP-5, which makes the DISCLAIMER file redundant.

commit | commitdiff | tree

Darik Horn [Sat, 3 Mar 2012 14:28:37 +0000 (08:28 -0600)]

Use consistent branding in package descriptions.

Change all instances of "OpenSolaris" to "Solaris" in the
debian/control file.

commit | commitdiff | tree

Brian Behlendorf [Fri, 2 Mar 2012 18:35:50 +0000 (10:35 -0800)]

Use SA_HDL_PRIVATE for SA xattrs

A private SA handle must be used to ensure we can drop the dbuf
hold on the spill block prior to calling dmu_tx_commit().  If we
call dmu_tx_commit() before sa_handle_destroy(), then our hold
will trigger a copy of the dbuf to be made.  This is done to
prevent data from leaking in to the syncing txg.  As a result
the original dirty spill block will remain cached.

Additionally, relying on the shared zp->z_sa_hdl is unsafe in
the xattr context because the znode may be asynchronously dropped
from the cache.  It's far safer and simpler just to use a private
handle for xattrs.  Plus any additional overhead is offset by
the avoidance of the previously mentioned memory copy.

These forever dirty buffers can be noticed in the arcstats under
the anon_size.  On a quiescent system the value should be zero.
Without this fix and a SA xattr write workload you will see
anon_size increase.  Eventually, if enough dirty data builds up
your system it will appear to hang.  This occurs because the dmu
won't allow new txs to be assigned until that dirty data is
flushed, and it won't be because it's not part of an assigned tx.

As an aside, I typically see anon_size lurk around 16k so I think
there is another place in the code which needs a similar fix.
However, this value doesn't grow over time so it isn't critical.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Issue #503
Issue #513

commit | commitdiff | tree

Darik Horn [Fri, 2 Mar 2012 19:12:52 +0000 (13:12 -0600)]

Remove the /etc/default/zfsload configuration file

The zfsload file is a relic of the ZoL 0.5 release series that was
deprecated in the ZoL 0.6 packaging. Delete zfsload if it is still
registered as a configuration file.

commit | commitdiff | tree

Darik Horn [Fri, 2 Mar 2012 15:23:41 +0000 (09:23 -0600)]

Split /etc/init.d/zfs into zfs-mount and zfs-share

ZFS depends on two distinct events during Linux system start:

* Local storage for `zfs mount -a`.
(eg: init $local_fs or upstart local-filesystems)

* NFS or CIFS services for `zfs share -a`.
(eg: nfsd or smbd events if they happen; otherwise init $network.)

Handling both events in a single `/etc/init.d/zfs` script is
unreliable, so split it into separate `/etc/init.d/zfs-mount`
and `/etc/init.d/zfs-share` scripts.

Also set sensible sysv sequence numbers for Debian and Ubuntu systems.

commit | commitdiff | tree

Darik Horn [Tue, 28 Feb 2012 19:30:50 +0000 (13:30 -0600)]

Add machine readable debian/copyright file.

Update the copyright file for DEP-5 policy conformance:

http://www.debian.org/doc/packaging-manuals/copyright-format/1.0/

commit | commitdiff | tree

Darik Horn [Tue, 28 Feb 2012 01:36:21 +0000 (19:36 -0600)]

PPA 0.6.0.52-0ubuntu1 release.

commit | commitdiff | tree

Darik Horn [Tue, 28 Feb 2012 01:35:05 +0000 (19:35 -0600)]

Refresh debian/patches after upstream merge.

commit | commitdiff | tree

Darik Horn [Tue, 28 Feb 2012 01:33:06 +0000 (19:33 -0600)]

Merge branch 'upstream'

commit | commitdiff | tree

Brian Behlendorf [Mon, 27 Feb 2012 18:12:40 +0000 (10:12 -0800)]

Cleanly support debug packages

Allow a source rpm to be rebuilt with debugging enabled.  This
avoids the need to have to manually modify the spec file.  By
default debugging is still largely disabled.  To enable specific
debugging features use the following options with rpmbuild.

  '--with debug'               - Enables ASSERTs

  # For example:
  $ rpmbuild --rebuild --with debug zfs-modules-0.6.0-rc6.src.rpm

Additionally, ZFS_CONFIG has been added to zfs_config.h for
packages which build against these headers.  This is critical
to ensure both zfs and the dependant package are using the same
prototype and structure definitions.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Brian Behlendorf [Fri, 20 Jan 2012 18:58:57 +0000 (10:58 -0800)]

Add 'dmu_tx' kstats entry

Keep counters for the various reasons that a thread may end up
in txg_wait_open() waiting on a new txg. This can be useful
when attempting to determine why a particular workload is
under performing.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Brian Behlendorf [Mon, 30 Jan 2012 21:28:40 +0000 (13:28 -0800)]

Add arc_state_t stats to arcstats

To ensure the arc is behaving properly we need greater visibility
in to exactly how it's managing the systems memory. This patch
takes one step in that direction be adding the current arc_state_t
for the anon, mru, mru_ghost, mfu, and mfs_ghost lists. The l2
arc_state_t is already well represented in the arcstats.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Ned Bass [Sat, 25 Feb 2012 00:37:41 +0000 (16:37 -0800)]

Return success from check_slice() if device doesn't exist

When creating a new pool, make_root_vdev() calls check_in_use() to
ensure that none of the consituent disks are in use.  If the disk
contains a valid vdev label it is read to retrieve the list of its
child vdevs and these are checked recursively.  However, the
partitions stored in the vdev label my no longer exist, for example
if the partition table has since been altered.  In any such case we
would want the pool creation to proceed, so this change removes the
check from check_slice() that returns an error if the device doesn't
exist.  As an added assurance, the Solaris implementation also
returns sucess on ENOENT.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Darik Horn [Sat, 25 Feb 2012 18:10:04 +0000 (12:10 -0600)]

Change POST_INSTALL to POST_BUILD in dkms.conf

Install zfs_config.h and Module.symvers earlier so that they can be
used by consumers before the zfs module is activated by dkms.

See also dajhorn/pkg-spl@a68ebb5f24d073aca5f336b2681ee0a64c1a83d8.

commit | commitdiff | tree

Alex Zhuravlev [Fri, 17 Feb 2012 20:09:21 +0000 (12:09 -0800)]

Export symbols for zero-copy

Export additional symbols to make use of the DMU's zero-copy
API. This allows external modules to move data in to and out of
the ARC without incurring the cost of a memory copy.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Richard Yao [Mon, 13 Feb 2012 15:38:58 +0000 (10:38 -0500)]

Support ashift=13 for 8KB SSD block sizes

New SSDs are now available which use an internal 8k block size.
To make sure ZFS can get the maximum performance out of these
devices we're increasing the maximum ashift to 13 (8KB).

This value is still small enough that we can fit 16 uberblocks
in the vdev ring label. However, I don't want to increase this
any futher or it will limit the ability the safely roll back a
pool to recover it.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #565

commit | commitdiff | tree

Turbo Fredriksson [Mon, 13 Feb 2012 09:27:01 +0000 (10:27 +0100)]

Add 'fsid' mount option to allowed options.

Resolves nfs-utils-1.0.x compatibility issue which requires
that the fsid be set in the export options.

exportfs: Warning: /tank/dir requires fsid= for NFS export

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #570

commit | commitdiff | tree

Darik Horn [Mon, 13 Feb 2012 01:30:21 +0000 (19:30 -0600)]

PPA 0.6.0.51-0ubuntu1 release.

commit | commitdiff | tree

Darik Horn [Mon, 13 Feb 2012 01:27:35 +0000 (19:27 -0600)]

Refresh debian/patches after upstream merge.

commit | commitdiff | tree

Darik Horn [Mon, 13 Feb 2012 01:18:01 +0000 (19:18 -0600)]

Merge branch 'upstream'

commit | commitdiff | tree

Brian Behlendorf [Fri, 10 Feb 2012 19:53:09 +0000 (11:53 -0800)]

Export symbols for zero-copy

Exported the required symbols to make use of the DMU's zero-copy
API. This allows external modules to move data in to and out of
the ARC without incurring the cost of a memory copy.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Brian Behlendorf [Fri, 10 Feb 2012 00:33:55 +0000 (16:33 -0800)]

Use spl_debug_* helpers

When configuring the spl debug log support use the provided wrapper
functions. This ensures that if --disable-debug-log was used when
buiding the spl the functions will have no effect.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Etienne Dechamps [Fri, 2 Sep 2011 13:23:12 +0000 (15:23 +0200)]

Add support for DISCARD to ZVOLs.

DISCARD (REQ_DISCARD, BLKDISCARD) is useful for thin provisioning.
It allows ZVOL clients to discard (unmap, trim) block ranges from
a ZVOL, thus optimizing disk space usage by allowing a ZVOL to
shrink instead of just grow.

We can't use zfs_space() or zfs_freesp() here, since these functions
only work on regular files, not volumes. Fortunately we can use the
low-level function dmu_free_long_range() which does exactly what we
want.

Currently the discard operation is not added to the log. That's not
a big deal since losing discard requests cannot result in data
corruption. It would however result in disk space usage higher than
it should be. Thus adding log support to zvol_discard() is probably
a good idea for a future improvement.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Etienne Dechamps [Fri, 2 Sep 2011 07:42:07 +0000 (09:42 +0200)]

Support the fallocate() file operation.

Currently only the (FALLOC_FL_PUNCH_HOLE) flag combination is
supported, since it's the only one that matches the behavior of
zfs_space(). This makes it pretty much useless in its current
form, but it's a start.

To support other flag combinations we would need to modify
zfs_space() to make it more flexible, or emulate the desired
functionality in zpl_fallocate().

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Issue #334

commit | commitdiff | tree

Etienne Dechamps [Fri, 2 Sep 2011 07:37:53 +0000 (09:37 +0200)]

Check permissions in zfs_space().

This isn't done on Solaris because on this OS zfs_space() can
only be called with an opened file handle. Since the addition of
zpl_truncate_range() this isn't the case anymore, so we need to
enforce access rights.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Issue #334

commit | commitdiff | tree

Etienne Dechamps [Thu, 1 Sep 2011 11:59:41 +0000 (13:59 +0200)]

Implement the truncate_range() inode operation.

This operation allows "hole punching" in ZFS files. On Solaris this
is done via the vop_space() system call, which maps to the zfs_space()
function. So we just need to write zpl_truncate_range() as a wrapper
around zfs_space().

Note that this only works for regular files, not ZVOLs.

This is currently an insecure implementation without permission
checking, although this isn't that big of a deal since truncate_range()
isn't even callable from userspace.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Issue #334

commit | commitdiff | tree

Brian Behlendorf [Thu, 9 Feb 2012 18:38:03 +0000 (10:38 -0800)]

Fix zconfig.sh non-optimal alignment

The recent zvol improvements have changed default suggested alignment
for zvols from 512b (default) to 8k (zvol blocksize).  Because of this
the zconfig.sh tests which create paritions are now generating a
warning about non-optimal alignments.

This change updates the need zconfig.sh tests such that a partition
will be properly aligned.  In the process, it shifts from using the
sfdisk utility to the parted utility to create partitions.  It also
moves the creation of labels, partitions, and filesystems in to
generic functions in common.sh.in.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Darik Horn [Wed, 8 Feb 2012 23:57:38 +0000 (17:57 -0600)]

PPA 0.6.0.50-0ubuntu1 release.

commit | commitdiff | tree

Darik Horn [Wed, 8 Feb 2012 23:55:26 +0000 (17:55 -0600)]

Refresh debian/patches after upstream merge.

commit | commitdiff | tree

Darik Horn [Wed, 8 Feb 2012 23:52:00 +0000 (17:52 -0600)]

Merge branch 'upstream'

commit | commitdiff | tree

Etienne Dechamps [Wed, 8 Feb 2012 21:41:41 +0000 (22:41 +0100)]

Use 32 as the default number of zvol threads.

Currently, the `zvol_threads` variable, which controls the number of worker
threads which process items from the ZVOL queues, is set to the number of
available CPUs.

This choice seems to be based on the assumption that ZVOL threads are
CPU-bound. This is not necessarily true, especially for synchronous writes.
Consider the situation described in the comments for `zil_commit()`, which is
called inside `zvol_write()` for synchronous writes:

> itxs are committed in batches. In a heavily stressed zil there will be a
> commit writer thread who is writing out a bunch of itxs to the log for a
> set of committing threads (cthreads) in the same batch as the writer.
> Those cthreads are all waiting on the same cv for that batch.
>
> There will also be a different and growing batch of threads that are
> waiting to commit (qthreads). When the committing batch completes a
> transition occurs such that the cthreads exit and the qthreads become
> cthreads. One of the new cthreads becomes he writer thread for the batch.
> Any new threads arriving become new qthreads.

We can easily deduce that, in the case of ZVOLs, there can be a maximum of
`zvol_threads` cthreads and qthreads. The default value for `zvol_threads` is
typically between 1 and 8, which is way too low in this case. This means
there will be a lot of small commits to the ZIL, which is very inefficient
compared to a few big commits, especially since we have to wait for the data
to be on stable storage. Increasing the number of threads will increase the
amount of data waiting to be commited and thus the size of the individual
commits.

On my system, in the context of VM disk image storage (lots of small
synchronous writes), increasing `zvol_threads` from 8 to 32 results in a 50%
increase in sequential synchronous write performance.

We should choose a more sensible default for `zvol_threads`. Unfortunately
the optimal value is difficult to determine automatically, since it depends
on the synchronous write latency of the underlying storage devices. In any
case, a hardcoded value of 32 would probably be better than the current
situation. Having a lot of ZVOL threads doesn't seem to have any real
downside anyway.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Fixes #392

commit | commitdiff | tree

Etienne Dechamps [Mon, 5 Sep 2011 13:15:45 +0000 (15:15 +0200)]

Improve ZVOL queue behavior.

The Linux block device queue subsystem exposes a number of configurable
settings described in Linux block/blk-settings.c. The defaults for these
settings are tuned for hard drives, and are not optimized for ZVOLs. Proper
configuration of these options would allow upper layers (I/O scheduler) to
take better decisions about write merging and ordering.

Detailed rationale:

- max_hw_sectors is set to unlimited (UINT_MAX). zvol_write() is able to
   handle writes of any size, so there's no reason to impose a limit. Let the
   upper layer decide.

- max_segments and max_segment_size are set to unlimited. zvol_write() will
   copy the requests' contents into a dbuf anyway, so the number and size of
   the segments are irrelevant. Let the upper layer decide.

- physical_block_size and io_opt are set to the ZVOL's block size. This
   has the potential to somewhat alleviate issue #361 for ZVOLs, by warning
   the upper layers that writes smaller than the volume's block size will be
   slow.

- The NONROT flag is set to indicate this isn't a rotational device.
   Although the backing zpool might be composed of rotational devices, the
   resulting ZVOL often doesn't exhibit the same behavior due to the COW
   mechanisms used by ZFS. Setting this flag will prevent upper layers from
   making useless decisions (such as reordering writes) based on incorrect
   assumptions about the behavior of the ZVOL.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Etienne Dechamps [Mon, 5 Sep 2011 09:11:38 +0000 (11:11 +0200)]

Fix synchronicity for ZVOLs.

zvol_write() assumes that the write request must be written to stable storage
if rq_is_sync() is true. Unfortunately, this assumption is incorrect. Indeed,
"sync" does *not* mean what we think it means in the context of the Linux
block layer. This is well explained in linux/fs.h:

    WRITE:       A normal async write. Device will be plugged.
    WRITE_SYNC:  Synchronous write. Identical to WRITE, but passes down
                 the hint that someone will be waiting on this IO
                 shortly.
    WRITE_FLUSH: Like WRITE_SYNC but with preceding cache flush.
    WRITE_FUA:   Like WRITE_SYNC but data is guaranteed to be on
                 non-volatile media on completion.

In other words, SYNC does not *mean* that the write must be on stable storage
on completion. It just means that someone is waiting on us to complete the
write request. Thus triggering a ZIL commit for each SYNC write request on a
ZVOL is unnecessary and harmful for performance. To make matters worse, ZVOL
users have no way to express that they actually want data to be written to
stable storage, which means the ZIL is broken for ZVOLs.

The request for stable storage is expressed by the FUA flag, so we must
commit the ZIL after the write if the FUA flag is set. In addition, we must
commit the ZIL before the write if the FLUSH flag is set.

Also, we must inform the block layer that we actually support FLUSH and FUA.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>

commit | commitdiff | tree

Etienne Dechamps [Fri, 2 Sep 2011 07:57:35 +0000 (09:57 +0200)]

Support "sync=always" for ZVOLs.

Currently the "sync=always" property works for regular ZFS datasets, but not
for ZVOLs. This patch remedies that.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Fixes #374.

commit | commitdiff | tree

Darik Horn [Sun, 5 Feb 2012 02:54:43 +0000 (20:54 -0600)]

Let libnvpair be linked independently of libzfs.

Autoconf will fail to detect the ZoL libnvpair on systems that do not
implicitly link library runtime dependencies, which is anything that
has the GCC 4.5 DCO update.

Build libuutil before libnvpair, and put it on the the LDADD line of
the libnvpair automake template.

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes: #560

commit | commitdiff | tree

Darik Horn [Fri, 3 Feb 2012 21:42:18 +0000 (15:42 -0600)]

PPA 0.6.0.49-0ubuntu1 release.

commit | commitdiff | tree

Darik Horn [Fri, 3 Feb 2012 21:39:31 +0000 (15:39 -0600)]

Add patch: Improve the --with-spl error

If the SPL module is unavailable, then the operator gets an error
message that does not apply to installations that are mananged by
DKMS. Change the error message to fit apt-get systems.

A better solution would be to implement a dependency model in DKMS,
or to bundle SPL into the ZFS package.

commit | commitdiff | tree

Darik Horn [Fri, 3 Feb 2012 21:37:53 +0000 (15:37 -0600)]

Refresh patches after upstream merge.

commit | commitdiff | tree

Darik Horn [Fri, 3 Feb 2012 20:57:14 +0000 (14:57 -0600)]

Merge branch 'upstream'

commit | commitdiff | tree

Darik Horn [Fri, 3 Feb 2012 19:33:58 +0000 (13:33 -0600)]

Revert "Manually sync scripts/ with the upstream release."

This reverts commit ae1cd09777a9fb4060fabf830df4c32824951f24
to keep pkg-zfs limited to the debian/ overlay.

These symlinks were dereferenced in pkg-zfs so that the zfs-linux
source package could be diffed directly onto the vanilla upstream
tarball, but the orig tarball produced by git-buildpackage from the
upstream/* git tags is now being published to the PPA instead.

Mirror of ZFSOnLinux for Debian

RSS Atom