GENEALOGIX

glx duplicates

Detect potential duplicate persons in a GLX archive

Synopsis

Scan a GLX archive for potential duplicate person records.

Compares all persons using a weighted scoring model based on:

Name similarity (Levenshtein distance, nickname matching, initials)
Birth/death year proximity
Birth/death place match
Shared relationships and events

Persons already linked by a direct relationship (parent-child, spouse, etc.) are automatically skipped since they are known to be distinct individuals. Pairs whose dated parent-role evidence makes them generationally impossible to be the same person (e.g., a documented adult parent vs. a newborn at the same time) are also suppressed automatically.

Use --threshold to adjust sensitivity (0.0-1.0, default 0.60). Higher values = fewer, higher-confidence matches.

glx duplicates [person] [flags]

Examples

  # Scan for duplicates in current directory
  glx duplicates

  # Scan with higher confidence threshold
  glx duplicates --threshold 0.8

  # Check a specific person for duplicates
  glx duplicates person-robert-webb

  # JSON output for tooling
  glx duplicates --json

  # Scan a specific archive
  glx duplicates --archive my-family-archive

Options

  -a, --archive string    Archive path (directory or single file) (default ".")
  -h, --help              help for duplicates
      --json              JSON output
      --threshold float   Minimum similarity score (0.0-1.0) (default 0.6)

Options inherited from parent commands

  -q, --quiet   Suppress non-error output (where supported)

glx duplicates ​

Synopsis ​

Examples ​

Options ​

Options inherited from parent commands ​

SEE ALSO ​

glx duplicates

Synopsis

Examples

Options

Options inherited from parent commands

SEE ALSO