finish whatsnew for 8.5
This commit is contained in:
parent
0a5375b6cd
commit
18586bec8a
241
whatsnew-8.5.md
241
whatsnew-8.5.md
@ -3,132 +3,181 @@ main features.
|
||||
|
||||
## New operators
|
||||
|
||||
thumbnail / thumbnail_buffer
|
||||
Almost all of the logic from the `vipsthumbnail` program is now in a pair of
|
||||
new operators, `vips_thumbnail()` and `vips_thumbnail_buffer()`. These are very
|
||||
handy for the various scripting languages with vips bindings: you can now make
|
||||
a high-quality, high-speed thumbnail in PHP (for example) with just:
|
||||
|
||||
```php
|
||||
$filename = ...;
|
||||
$image = Vips\Image::thumbnail($filename, 200, ["height" => 200]);
|
||||
$image.writeToFile("my-thumbnail.jpg");
|
||||
```
|
||||
|
||||
How seq used to work:
|
||||
The new thumbnail operator has also picked up some useful features:
|
||||
|
||||
* The vips sink functions create a set of N threads and scan images
|
||||
top-to-bottom in tiles, allocating tiles to workers as they finish.
|
||||
* **Smart crop** A new cropping mode called `attention` searches the image for
|
||||
edges, skin tones and areas of saturated colour, and attempts to position the
|
||||
crop box over the most significant feature.
|
||||
|
||||
* They have extra logic to keep workers together. They track the position
|
||||
of the most-delayed worker and if the lead thread gets more than M scanlines
|
||||
ahead, it stalls until the stragglers catch up.
|
||||
* **Crop constraints** Thanks to tomasc, libvips has crop constraints. You
|
||||
can set it to only thumbnail if the image is larger or smaller than the target
|
||||
(the `<` and `>` modifiers in imagemagick), and to crop to a width or height.
|
||||
|
||||
* There is more logic in the loaders: they keep track of the current
|
||||
Y position, and if the lead request thread gets ahead of the current
|
||||
read point, it stalls with a 60s timeout until the intermediate tiles are
|
||||
requested. This logic is implemented in the `vips_sequential()` operation.
|
||||
* **Buffer sources** `vips_thumbnail_buffer()` will thumbnail an image held as
|
||||
a formatted block of data in memory. This is useful for cloud services, where
|
||||
the filesystem is often rather slow.
|
||||
|
||||
The point of trying to keep thread locality and ordering is that we want
|
||||
to limit the number of scanlines that loaders have to keep behind the read
|
||||
point. We want to stream images through memory, not be forced into a load /
|
||||
process / save model.
|
||||
CLAHE, or Contrast-Limited Adaptive Histogram Equalisation, is a simple way to
|
||||
make local histogram equalisation more useful.
|
||||
|
||||
This works reasonably well for simple cases, like thumbnailing single images,
|
||||
but can fail in more complex cases, such as repeated `vips_insert()`. Consider
|
||||
this Python program:
|
||||
Plain local equalization removes
|
||||
all global brightness variation and can make images hard to understand.
|
||||
The `hist_local` operator now has a `max-slope` parameter you can use to limit
|
||||
how much equalisation can alter your image. A value of 3 generally works well.
|
||||
|
||||
## Toilet roll images
|
||||
|
||||
libvips used to let you pick single pages out of multi-page images, such
|
||||
as PDFs, but had little support for processing entire documents.
|
||||
|
||||
libvips 8.5 now has good support for toilet roll images. You can load a
|
||||
multipage image as a very tall, thin strip, process the whole thing, and write
|
||||
back to another multi-page file. The extra feature is an `n` parameter which
|
||||
gives the number of pages to load, or -1 to load all pages.
|
||||
|
||||
For example, (OME-
|
||||
TIFF)[https://www.openmicroscopy.org/site/support/ome-model/ome-tiff]
|
||||
is a standard for microscopy data that stores volumetric images as multi-page
|
||||
TIFFs. They have some (sample
|
||||
data)[https://www.openmicroscopy.org/site/support/ome-model/ome-tiff/data.html]
|
||||
including a 4D image of an embryo.
|
||||
|
||||
Each TIFF contains 10 slices. Normally you just see page 0:
|
||||
|
||||
```
|
||||
$ vipsheader tubhiswt_C0_TP13.ome.tif
|
||||
tubhiswt_C0_TP13.ome.tif: 512x512 uchar, 1 band, b-w, tiffload
|
||||
```
|
||||
|
||||
Use `n=-1` and you see all the pages as a very tall strip:
|
||||
|
||||
```
|
||||
$ vipsheader tubhiswt_C0_TP13.ome.tif[n=-1]
|
||||
tubhiswt_C0_TP13.ome.tif: 512x5120 uchar, 1 band, b-w, tiffload
|
||||
```
|
||||
|
||||
You can work with PDF, TIFF, GIF and all imagemagick-supported formats in
|
||||
this way.
|
||||
|
||||
You can write this tall strip to another file, and it will be broken up into
|
||||
pages:
|
||||
|
||||
```
|
||||
$ vips copy tubhiswt_C0_TP13.ome.tif[n=-1] x.tif
|
||||
$ vipsheader x.tif
|
||||
x.tif: 512x512 uchar, 1 band, b-w, tiffload
|
||||
$ vipsheader x.tif[n=-1]
|
||||
x.tif: 512x5120 uchar, 1 band, b-w, tiffload
|
||||
```
|
||||
|
||||
The extra magic is a `page-height` property that images carry around that says
|
||||
how long each sheet of toilet paper is.
|
||||
|
||||
There are clearly some restrictions with this style of multi-page document
|
||||
handling: all pages must have identical width, height and colour depth; and image
|
||||
processing operators have no idea they are dealing with a multi-page document,
|
||||
so if you do something like `resize`, you'll need to update `page-height`.
|
||||
You'll also need to be careful about edge effects if you're using spatial
|
||||
filters.
|
||||
|
||||
## Computation reordering
|
||||
|
||||
Thanks to the developer of
|
||||
(PhotoFlow)[https://github.com/aferrero2707/PhotoFlow], a non-destructive image
|
||||
editor with a libvips backend, libvips can now reorder computations to reduce
|
||||
recalculation. This can (sometimes) produce a dramatic speedup.
|
||||
|
||||
This has been (discussed on the libvips
|
||||
blog)[http://libvips.blogspot.co.uk/2017/01/automatic-computation-reordering.html],
|
||||
but briefly, the order in which operator arguments are evaluated can have a
|
||||
big effect on runtime due to the way libvips tries to cache and reuse results
|
||||
behind the scenes.
|
||||
|
||||
The blog post has some examples and some graphs.
|
||||
|
||||
## New sequential mode
|
||||
|
||||
libvips sequential mode has been around for a while. This is the thing libvips
|
||||
uses to stream pixels through your computer, from input file to output file,
|
||||
without having to have the whole image in memory all at the same time. When it
|
||||
works, it give a nice performance boost and a large drop in memory use.
|
||||
|
||||
There are some more complex cases where it didn't work. Consider this Python
|
||||
program:
|
||||
|
||||
```python
|
||||
#!/usr/bin/python
|
||||
|
||||
import sys import random
|
||||
import sys
|
||||
import random
|
||||
|
||||
import gi gi.require_version('Vips', '8.0') from gi.repository import Vips
|
||||
import gi
|
||||
gi.require_version('Vips', '8.0')
|
||||
from gi.repository import Vips
|
||||
|
||||
composite = Vips.Image.black(100000, 100000)
|
||||
composite = Vips.Image.black(10000, 10000)
|
||||
|
||||
for filename in sys.argv[2:]:
|
||||
tile = Vips.Image.new_from_file(filename, access = Vips.Access.SEQUENTIAL)
|
||||
x = random.randint(0, composite.width - tile.width) y = random.randint(0,
|
||||
composite.height - tile.height) composite = composite.insert(tile, x, y)
|
||||
x = random.randint(0, composite.width - tile.width)
|
||||
y = random.randint(0, composite.height - tile.height)
|
||||
composite = composite.insert(tile, x, y)
|
||||
|
||||
composite.write_to_file(sys.argv[1])
|
||||
```
|
||||
|
||||
This makes a 100,000 x 100,000 pixel black image, then inserts a lot of
|
||||
other files into it and writes the result.
|
||||
It makes a large 10,000 x 10,000 pixel image, then inserts all of the images
|
||||
you list at random positions, then writes the result.
|
||||
|
||||
With vips8.4, this could very easily fail. Imagine this situation:
|
||||
You'd think this could work with sequential mode, but sadly with earlier
|
||||
libvipses it will sometimes fail. The problem is that images can cover each
|
||||
other, so while writing, libvips can discover that it only needs the bottom few
|
||||
pixels of one of the input images. The image loaders used to track the current
|
||||
read position, and if a request came in for some pixels way down the image,
|
||||
they'd assume one of the evaluation threads had run ahead of the rest and
|
||||
needed to be stalled. Once stalled, it was only restarted on a long timeout,
|
||||
causing performance to drop through the floor.
|
||||
|
||||
* image1 is very tall and thin
|
||||
libvips 8.5 has a new implementation of sequential mode that changes the way
|
||||
threads are kept together as images are processed. Rather than trying to add
|
||||
constraints to load operations, instead it puts the constraints into operations
|
||||
that can cause threads to become spread out, such as vertical shrink.
|
||||
|
||||
* image2 is short and fat, and by chance covers the centre of image1
|
||||
As a result of this change, many more things can run in sequential mode, and
|
||||
out of order reads should be impossible.
|
||||
|
||||
* we'll write the top part of image1, then write the body of image2
|
||||
## `libxml2` swapped out for `expat`
|
||||
|
||||
* after image2 has been written, we need to write the bottom of image1,
|
||||
so a thread will ask for a set of pixels near the end of image1
|
||||
libvips has used libxml2 as its XML parser since dinosaurs roamed the Earth.
|
||||
Now libvips is based on gobject, the XML parser selected by glib, expat, makes
|
||||
more sense, since it will already be linked.
|
||||
|
||||
* image1 knows that the previous request was for some scanlines near the top,
|
||||
so it thinks this request must be from a thread that has run way ahead of
|
||||
the pack and stalls it
|
||||
|
||||
And we have a deadlock. In fact, vips wouldn't deadlock, it would just
|
||||
pause on a 60s timeout on each thread. Sad!
|
||||
|
||||
Here's how the new seq works:
|
||||
|
||||
* Sinks work as before.
|
||||
|
||||
* Loaders use a revised `vips_sequential()` with the stalling logic
|
||||
removed. All it does now is track the read position, cache a few 100 lines
|
||||
behind the read point, and makes sure that lines are evaluated in order
|
||||
with no missing areas.
|
||||
|
||||
* Operations like `vips_shrinkv()` which can cause large non-local references
|
||||
have an extra bit of code which, if the input comes from a sequential source,
|
||||
adds a an extra `vips_sequential()` operator on the output. This forces
|
||||
`vips_shrinkv()` input to be sequential.
|
||||
|
||||
The old one constrained thread location on output, and on input as well. The
|
||||
new idea is to get rid of input constraints, and instead add extra code
|
||||
to the operations which could trigger large non-local references. Rather
|
||||
than tying threads down to stop them drifting apart, it makes sure they
|
||||
can never get too far apart in the first place.
|
||||
|
||||
Running the test program with git master gives this result:
|
||||
|
||||
```
|
||||
real 1m2.317s
|
||||
user 2m58.472s
|
||||
sys 0m7.568s
|
||||
peak mem: 10gb
|
||||
```
|
||||
|
||||
Not bad!
|
||||
|
||||
## Reordering
|
||||
|
||||
from blog
|
||||
|
||||
## Image resize
|
||||
|
||||
smartcrop, more flexible cropping
|
||||
It's nice to be able to remove a required dependency for a change.
|
||||
|
||||
## File format support
|
||||
|
||||
As usual, there are a lot of improvements to file format read and write.
|
||||
As usual, there are a range of improvements to file format read and write.
|
||||
|
||||
* Thanks to work by Felix Bünemann, `webp` read and write supports many more
|
||||
options.
|
||||
* andris has improved `pdfload` so you can load many pages in a single
|
||||
operation.
|
||||
* Many people have worked on `dzsave` Google mode. It's now better at
|
||||
skipping blank tiles and supports tile overlaps. Felix Bünemann added
|
||||
support for compressed zip output.
|
||||
* Henri Chain has added `radsave_buffer` to improve Radiance support.
|
||||
* TIFF files with an orientation tag should now autorotate, `tiffsave`
|
||||
has better jpeg compression support, and it knows about the `strip`
|
||||
metadata option.
|
||||
* The load-via-libMagick operator now supports IM7.
|
||||
* The GIF loader is much smarter about guessing the number of colour channels.
|
||||
* PNG save supports `strip`.
|
||||
* The SVG loader supports `svgz` compressed files thanks to Felix Bünemann.
|
||||
* Thanks to a push from Felix Bünemann, TIFF now supports load and save to and
|
||||
from memory buffers.
|
||||
* `dzsave` can write to memory (as a zip file) as well.
|
||||
* Again, thanks to pushing from Felix, libvips now supports ICC, XMP and IPCT
|
||||
metadata for WebP images.
|
||||
* FITS images support `bzero` and `bscale`.
|
||||
* `tiffload` memory use is now much lower for images with large strips.
|
||||
|
||||
## Other
|
||||
|
||||
Improvements to the build system, reductions in memory use, many small
|
||||
bug fixes, improvements to the C++ binding, improvements to the Python binding,
|
||||
many small performance fixes. As usual, the ChanegLog has more detail if
|
||||
you're interested.
|
||||
Many small bug fixes, improvements to the C++ binding.
|
||||
As usual, the ChanegLog has more detail if you're interested.
|
||||
|
Loading…
Reference in New Issue
Block a user