idgrep

Tool for finding identifiers in files. Supports recursive globbing and boolean arbitrary complex queries.

Examples

Find all files that contains cats, dogs or both.
```
$ idgrep -- 'cats | dogs'
```
Find all files that contains either
- fish without cats or dogs
- cats or dogs without fish
```
$ idgrep -- 'fish ^ (cats | dogs)'
```
List identifier count in all small (< 200 bytes) files
```
$ idgrep -l 200 --file-id-count
```
List all identifiers beginning with a or A, sort by size.
```
$ idgrep -i --sort-by-size -- 'a*'
```
Find all python (.py) files in current directory without descending into directories, looking for cat.
```
$ idgrep -p '*.py' -- cat
```
Find all python (.py) files in current directory and descending into directories, looking for anything not being cat.
```
$ idgrep -p '**/*.py' -- '~cat'
```
Find files with identifiers including data, group by identifier
```
$ idgrep --group-by-id -- '*data*'
```

Notes about shells

Note that in many shells the following characters needs escaping: !, (, ), &, |, *, ?

The easiest option is to put the entire query within single (') or double (") quotation marks.

Usage

usage: idgrep [-p FILE_PATTERN] [-l LIMIT_SIZE] [-d LIMIT_IDENTIFIERS] [-i]
              [--help | --file-id-count] [--group-by-id]
              [--sort-by-name | --sort-by-count | --sort-by-size]
              [--ascending | --descending] [paths ...] -- query

positional arguments:
  [paths] -- query

options:
  -p FILE_PATTERN, --file-pattern FILE_PATTERN
  -l LIMIT_SIZE, --limit-size LIMIT_SIZE
  -d LIMIT_IDENTIFIERS, --limit-identifiers LIMIT_IDENTIFIERS
  -i
  --help
  --file-id-count
  --group-by-id
  --sort-by-name
  --sort-by-count
  --sort-by-size
  --ascending, --asc
  --descending, --desc

Details

-p | --file-pattern

The file pattern is processed using the glob¹ function of pathlib.

-l | --limit-size

Sets a maximum size limit for files to prevent processing large files. This is enabled by default and set to one megabyte (1M).

The following suffixes are recognized:

suffix	size
k	2¹⁰ bytes
m	2²⁰ bytes
g	2³⁰ bytes
t	2⁴⁰ bytes

-d | --limit-identifiers

Sets the maximum identifier count limit for files to prevent processing files with too many identifiers. This is enabled by default and set to 1k.

The following suffixes are recognized:

suffix	count
k	10³ identifiers
m	10⁶ identifiers

-i

Ignore case in matching.

--help

Shows the output shown above under Usage.

--file-id-count

Counts number of identifiers in all matching files. Any query entered will be ignored.

--group-by-id

Group output by identifier. If a file is matching multiple identifiers it will be listed multiple times.

--sort-by-name | --sort-by-count | --sort-by-size

Select sort key for output.

--ascending | --asc | --descending | --desc

Select sort order.

paths

The paths to perform search in. Currently exclusion is not supported but will be added in a later version.

query

The query is added last after a double dash. This double dash is used to separate file arguments from the query. If you wish to specify a file named double dash, use ./-- for it.

Query Format

~identifier

Do not match this identifier (logical not)
!identifier

Do not match this identifier (logical not)
expr_1 & expr_2

Match both expr_1 and expr_2 (and)
expr_1 | expr_2

Match either expr_1 or expr_2 or both together (inclusive or)
expr_1 ^ expr_2

Match either expr_1 or expr_2 but not both together (exclusive or)
(subexpression)

Use parenthesis to define subexpressions. This is useful for queries such as:

(fish | birds) & (chips | bees)

See https://docs.python.org/3/library/pathlib.html?highlight=path#pathlib.Path.glob for more information. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.editorconfig		.editorconfig
LICENSE		LICENSE
README.md		README.md
idgrep.py		idgrep.py
query_parser.py		query_parser.py
query_set.py		query_set.py
query_set_test.py		query_set_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

idgrep

Examples

Notes about shells

Usage

Details

-p | --file-pattern

-l | --limit-size

-d | --limit-identifiers

-i

--help

--file-id-count

--group-by-id

--sort-by-name | --sort-by-count | --sort-by-size

--ascending | --asc | --descending | --desc

paths

query

Query Format

About

Releases

Packages

Languages

License

JohnnyAmos/idgrep

Folders and files

Latest commit

History

Repository files navigation

idgrep

Examples

Notes about shells

Usage

Details

-p | --file-pattern

-l | --limit-size

-d | --limit-identifiers

-i

--help

--file-id-count

--group-by-id

--sort-by-name | --sort-by-count | --sort-by-size

--ascending | --asc | --descending | --desc

paths

query

Query Format

Footnotes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages