Boston Linux & Unix (BLU) Home | Calendar | Mail Lists | List Archives | Desktop SIG | Hardware Hacking SIG
Wiki | Flickr | PicasaWeb | Video | Maps & Directions | Installfests | Keysignings
Linux Cafe | Meeting Notes | Blog | Linux Links | Bling | About BLU

BLU Discuss list archive


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Help with finding duplicate photos



 On Thu, Feb 21, 2008 at 08:26:01PM -0500, Tom Haskins-Vaughan wrote: 
> Hi, here's the scenario. I have a directory, /home/photos and in that 
> folder are lots and lots of photos in many different subfolders. Now, 
> since I'm paying by the gig to back these up remotely I'd like to make 
> sure that there are no duplicates. If i can get a list of duplicate 
> filenames, I'm happy to go through and check that they're actually 
> duplicates manually, but I'm not too good on the command line. 

Two of my favorite tools for cleaning up after merging a dozen 
different home directories are FSlint and GQview. 

http://www.pixelbeat.org/fslint/
"FSlint is a utility to find and clean various forms of lint on a 
filesystem.  For example, one form of lint it finds is duplicate 
files.  It has both GUI and command line modes." 

http://gqview.sourceforge.net/
"an image browser that features single click access to view images and 
move around the directory tree 
 Features: Find duplicates based on file attributes or image content." 

GQview's duplicate finder is really nice.  Right-click on a directory 
name, select "Find duplicates recursive", set "Compare by" to 
"Similarity", and check the "Thumbnails" box.  Depending on how many 
images you have it might take a long time[1].  The end result is a 
list of similar images, where you might have both an original and 
retouched version.  You can also compare by checksum, which will find 
exact matches regardless of file name or location.  From the list you 
can select the copies you don't want to keep and delete them. 


[1] Scanning 4056 files, most of which are 7.1MP jpeg, takes about 20 
minutes on my new(ish) T61 laptop. 

-- 
Matt Brodeur                                                    RHCDS 
[hidden email]                         http://www.nexttime.com
PGP ID: 2CFE18A3 / 9EBA 7F1E 42D1 7A43 5884  560C 73CF D615 2CFE 18A3 
It is a miracle that curiosity survives formal education. 
                -- Albert Einstein 
_______________________________________________ 
Discuss mailing list 
[hidden email] 
http://lists.blu.org/mailman/listinfo/discuss
 


BLU is a member of BostonUserGroups
BLU is a member of BostonUserGroups
We also thank MIT for the use of their facilities.

Valid HTML 4.01! Valid CSS!



Boston Linux & Unix / webmaster@blu.org