Coder Social home page Coder Social logo

findupe's Introduction

Find Duplicates

Go code to find duplicate files by using size+sha512, with option to also use md5 sums to deepen confidence in matches.

Installation

findupe is written in go-lang and is provided as source, so you'll need to have Google's "go" language tools available.

Golang Installation

Windows

	winget install golang

Debian

	sudo apt install golang
	# or
	snap install --classic go

Redhat/CentOS

	sudo yum install golang
	# or
	snap install --classic go

MacOS

	brew install golang

Installing Findupe

go get -u github.com/kfsone/findupe

You can then optionally either install it, which requires ~/go/bin to be in your path:

go install github.com/kfsone/findupe

Or you can run it from the command line with go run:

go run github.com/kfsone/findupe

Usage

-L, --list-collisions   List files for which matches were found.
-b, --min-bytes int     Minimum size (bytes) for file to consider. (default 256)
-p, --path string       Directory to recurse over. (default ".")
-T, --thorough          Append SHA sums with MD5 sums.
-j, --threads int       Number of concurrent workers. (default 9)

Examples

Search for duplicates under some path. with only a summary.

findupe --path some/sub/dir

Search '/tmp' for files over 1k in size, use both SHA 512 and MD5 sums to maximize confidence that duplicates are exact matches, use 32 concurrent workers, and list the actual files.

findupe -b 1024 --list-collisions -T -p /tmp

findupe's People

Contributors

kfsone avatar

Watchers

 avatar James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.