Re: [LUG] filtering recovered jpeg files

To: list@xxxxxxxxxxxxx

Subject: Re: [LUG] filtering recovered jpeg files

From: Adrian Midgley <amidgley@xxxxxxxxx>

Date: Tue, 21 Aug 2018 12:48:16 +0100

Delivered-to: dclug@xxxxxxxxxxxxxxxxxxxxx

Dkim-signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=dclug.org.uk; s=1523264761; h=Sender:Content-Type:Reply-To:List-Subscribe: List-Help:List-Post:List-Unsubscribe:List-Id:Subject:To:Message-ID:Date:From: In-Reply-To:References:MIME-Version:Cc:Content-Transfer-Encoding:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:List-Owner:List-Archive; bh=TxQr0/SbrAjJMewN9V/xDOJ7uqKisUfEk0e/gNjXUhQ=; b=Jxm4xdMx74xenUQGIygW27RyFG VujObJSufb2ftwSfGKiSx+jeZWabGAaJdsHq8Fu4/wWjvCpMVLpWzrxvKG/v0W7etRVNe24Cl4AwT AcyXhShByJhXx/6yL354TzKDBboetu5S02SuW+A+5Qc0azewkzWK/dtwPiT8z2fntbrI=;

Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to; bh=rM+FHbYVmvQd6ljyqRzf0H5XSf7XM1kDUhBlstX1s0I=; b=mfdqJ4H/l+Qv05T6ipyQqiWO96qf4YBtBe0eFVcPlG2UZZmBCmO0x7oVLdf0ceHCt0 8IU5X6TrD9lTEpnmys3ZY7CSdfZE/EU9hE+OuoYZvw8V3OqeJRx8CPT+XS0XoNwo8IMI SCPCYcFzxdxL5uoaexYNZ07JmuAcNKB0uNKbyFG5CjXqr36MF5GlQd9qmz1gpjYcSTre YxM1Y3RvCDJeGp668kDyzgs1z5w+ge20/TAqEj5Bby0A3UisNjELOj4pMUZE9MmgAYLJ GjnlnuED85uqt46gH6QdRF/QitInDp8dJXZEcRXxdW7zAjpr/srAipeCwDxhiV+78M5o sRyQ==

Listing them in a GUI filemanager by size and dragging the big ones off to one place and the small to another is too easy?

If you have keywords in the EXIF then you could read that with a script, I suppose.

On Tue, 21 Aug 2018 at 12:33 Gordon Henderson <gordon+lug@xxxxxxxxxx> wrote:

On Mon, 20 Aug 2018, Pentiddy wrote:

> Hello all...
> A few years back I made the novice mistake of losing all my photos due to a
> disk crash...
> I did however manage to run a recovery program on the disk and get images off
> it.
> I am wanting to finally sort through these and delete spurious thumbnails,
> internet cached images etc. and wondered if there is a convenient way of for
> instance filtering just full sized pictures out of the folders, or deleting
> files below a certain size... I am Xfce based so thunar scripts an option...
> Any help with this greatly appreciated- trying to sort out some memorable
> photo's for my Daughter leaving home.

How good is your shell-fu?

You can use the jpeginfo command to get the size:

jpeginfo *.jpg > /tmp/foo

then edit /tmp/foo and look for the ones that are not thumbnails.

A quick example output:

IMG_20180116_193807.jpg 4160 x 3120 24bit Exif N 3832530
j.jpg 495 x 811 24bit Exif N 104293
loaf.jpg 2000 x 2694 24bit JFIF P 753672

So the first is a camera image, the 2nd - an odd size, but probably
something scaled for the web, the last another (probably) processed image.
Thumbnails might be 128x or smaller, so you can manually get rid of them
by sifting through the file...

However something like:

sort -k2 -nr /tmp/foo > /tmp/foo2

will reverse sort the file on the 2nd field (the X size) and write it to
/tmp/foo2 - using the example above yields:

IMG_20180116_193807.jpg 4160 x 3120 24bit Exif N 3832530
loaf.jpg 2000 x 2694 24bit JFIF P 753672
j.jpg 495 x 811 24bit Exif N 104293

... you can then manually edit the file and cull the lines with X size
smaller than something you determine (this sorts largest at the top, so go
to the end of the file) - or futher edit commands into the file, to turn
the file into a script with e.g. rm commands at the start if the files you
want to delete.

And so on.

You can get more clever using extra stuff like the awk command which is
better at finding a <= number thing, but then you're into diminishing
returns for a one-off task. Faced with a few directories of a few 1000
random files then this is how I'd do it.

Gordon

--
The Mailing List for the Devon & Cornwall LUG
https://mailman.dclug.org.uk/listinfo/list
FAQ: http://www.dcglug.org.uk/listfaq

A Midgley

Nowadays: Property and Photography