File Deduplication (Docker)
Archived

File Deduplication (Docker)

A deduplication image that scans and removes duplicate files in directories.

GoDockerSHA512MD5
GitHub Project

This is a simple project designed to help manage and clean up directories by identifying and removing duplicate files and empty directories. It scans specified root directories, finds duplicate files based on their content, logs file details, and optionally deletes them based on filenames and dates.

This project uses Go libraries like logrus for logging and caarlos0/env for handling environment variables. It first finds all files with the same sizes, then relies on MD5 and SHA-512 hashing to spot duplicate files accurately.

The project is designed to run in a container with configurable environment variables:

docker-compose.yml
services:
  dedup:
    image: meinya/dedup:latest
    volumes:
        - /to_dedup:/dedup
    environment:
        - DO_REMOVE=false
        - MIN_SIZE=100000000
        - EXCLUDE_EXT=.3fr

Tags:

Go,

Docker,

SHA512,

MD5

Previous
Trading System
Next
Recipe Community