Linux's fastest text search tool ripgrep (the best alternative to grep)

Linux's fastest text search tool ripgrep (the best alternative to grep)

Preface

Speaking of text search tools, everyone must know grep, which is one of the most useful and commonly used tools in Linux.
But if you want to search for a keyword in a large engineering project, you must know that it is time-consuming.
So there are many alternative tools, the most famous ones are Ack, Ag
Recently, a new alternative, Ripgrep, has emerged. This tool uses multi-threading like Ack/Ag, but rg is faster than them.

Introduction

ripgrep is a line-based search tool that recursively searches the specified directory for the provided pattern. It is written in Rust language and is incredibly fast compared to similar tools.
Several features are as follows:

  • Automatic recursive search (grep requires -R)
  • Automatically ignore files in .gitignore and binary files
  • You can search for a specific file type (rg -tpy foo limits python files, rg -Tjs foo excludes js files)
  • Supports most grep features (all commonly used ones)
  • Supports various file compilations (UTF-8, UTF-16, latin-1, GBK, EUC-JP, Shift_JIS, etc.)
  • Supports searching common compressed files (gzip, xz, lzma, bzip2, lz4)
  • Automatically highlight matching results
  • Shorter command names rg (grep is four characters)
  • Does not support multi-line searches and fancy regular expressions

Install ripgrep

Install RUST first

curl https://sh.rustup.rs -sSf | sh

Then just press enter.

Install rigpre ​​with RUST

git clone https://github.com/BurntSushi/ripgrep
cd ripgrep
cargo build --release
cp ./target/release/rg /usr/local/bin/

The last step is to put it in a path in PATH according to your situation.

use

Search results display

Usage General Format

USAGE:
  rg [OPTIONS] PATTERN [PATH ...]
  rg [OPTIONS] [-e PATTERN ...] [-f PATTERNFILE ...] [PATH ...]
  rg [OPTIONS] --files [PATH ...]
  rg [OPTIONS] --type-list
  command | rg [OPTIONS] PATTERN

Input Parameters

ARGS:
  <PATTERN>
      A regular expression used for searching. To match a pattern beginning with a
      dash, use the -e/--regexp flag.

      For example, to search for the literal '-foo', you can use this flag:

        rg -e -foo

      You can also use the special '--' delimiter to indicate that no more flags
      will be provided. Namely, the following is equivalent to the above:

        rg -- -foo

  <PATH>...
      A file or directory to search. Directories are searched recursively. Paths specified on
      the command line override glob and ignore rules.

options Description other
-A, --after-context <NUM> Show <NUM> lines after the matching content Will override --context
-B, --before-context <NUM> Show <NUM> lines before the matching content Will override --context
-b, --byte-offset Displays the byte offset of the matching content in the file Use with -o to print only the offset
-s, --case-sensitive Case sensitivity Will override -i (ignore case), -S (smart case)
--color <WHEN> When to use color, default is auto If --vimgre is used, the default value is never
The options are: never, auto, always, ansi
--colors <COLOR_SPEC>... Set the output color: color: red, blue, green, cyan
{type}:{attribute}:{value} magenta, yellow, white, black
{type} : path, line, column, match style: nobold, bold, nointense
{attribute} : fg, bg, style intense, nounderline, underline
{value} : a color or a text style Example:
{type}:none will clear the color setting of {type} rg --colors 'match:fg:magenta' --colors 'line:bg:yellow' foo
The extended color set can be used by {value} if the terminal supports ANSI color
The description method is 'x' (256-color) or 'x,x,x' (24-bit true color)
x is a value between 0 and 255, the default is decimal, and the prefix 0x is hexadecimal
For example: rg --colors 'match:bg:0,128,255'
or equivalently: rg --colors 'match:bg:0x0,0x80,0xFF'
Intense and nointense are invalid when using extended color code.
--column The column number of the first match (starting from 1) Can be disabled by --no-column
-C, --context <NUM> Show <NUM> lines before and after the match This overrides the -B and -A options.
--context-separator <SEPARATOR> Used to separate non-consecutive lines in output x7F or t can be used, the default is --
-c, --count Show only matching lines If there is only one file given to ripgrep, only the number of matching lines will be printed.
You can use --with-filename to force printing of the file name.
This overrides the --count-matches option.
--count-matches Only show the number of matches You can use --with-file to force output of the file name when there is only one file.
--debug Display debug information
--dfa-size-limit <NUM+SUFFIX?> The upper limit of regex DFA, default is 10M
-E, --encoding <ENCODING> Description text encoding, default is auto https://encoding.spec.whatwg.org/#concept-encoding-get
-f, --file <PATTERNFILE>... Read pattern from file, one pattern per line Can be used multiple times or in combination with -e, so any combination will be matched
--files Print all files that will be searched Use with rg <options> --files [PATH...] , without pattern
-l, --files-with-matches Print only the file names that match Override --files-without-match
--files-without-match Print only the file names that do not match Override --file-with-matches
-F, --fixed-strings Treat pattern as regular text instead of regex This option can be disabled with --no-fixed-strings
-L, --follow Will recursively search for links, closed by default You can use --no-follow to turn it off
-g, --glob <GLOB>... Wildcard files or folders can be negated with ! Can be used multiple times, will match the wildcard rules of .gitignore
-h, --help Print help information
--heading Print the file name above the matching content instead of on the same line This is the default behavior and can be disabled with --no-heading
--hidden Search hidden files and folders Ignored by default, can be disabled with --no-hidden
--iglob <GLOB>... Same as --glob, but case-insensitive
-i, --ignore-case pattern is case insensitive This option can be overridden by -s/--case-sensitive or -S/--smart-case
--ignore-file <PATH>... Ignore paths, the format is the same as .gitignore, multiple paths are allowed When there are multiple --ignore-file tags, the latter one takes precedence.
When on the command, use -g to achieve the same effect
-v, --invert-match Reverse Match
-n, --line-number Display file line number, open by default
-x, --line-regexp Only display lines that completely match pattern Will override --word-regexp
-M, --max-columns <NUM> Do not print matching lines longer than <NUM> columns
-m, --max-count <NUM> Limit a file to a maximum of <NUM> lines.
--max-depth <NUM> Limit the depth of recursive folder searches rg --max-depth 0 dir/ Do not perform any search
--max-filesize <NUM+SUFFIX?> Ignore files larger than <NUM> bytes suffix can be K, M, G, the default is byte
--mmap Try to use memory maps, default behavior Currently it does not support all options, use --no-mmap to disable
--no-config Do not read conf files, ignore RIPGREP_CONFIG_PATH
--no-filename Do not print matching file names
--no-heading Print the file name before each matching line.
--no-ignore Cancel ignore files, such as .gitignore, .ignore Can be turned off with --ignore
--no-ignore-global Cancel reading the global ignore file For example, $HOME/.config/git/ignore
--no-ignore-messages Cancel parsing errors related to .ignore and .gitignore files Can be disabled with --ignore-messages
--no-ignore-parent Do not read .gitignore, .ignore files in parent folders Can be disabled with --ignore-parent
--no-ignore-vcs Only the catch-all .ignore file Can be turned off with --ignore-vcs
-N, --no-line-number Do not print matching line numbers
--no-messages Do not print errors related to opening and reading files
-0, --null Add a NUL character after the printed file path Very useful for xargs
-o, --only-matching Print only the matched content, not the entire line
--passthru Print matching and non-matching lines
--path-separator <SEPARATOR> Path separator, default is / on Linux
--pre <COMMAND> Process the file with <COMMAND> and pass the result to rg There may be a huge performance penalty
For example
case "$1" in
*.pdf)
exec pdftotext "$1" -
;;
*)
case $(file "$1") in
_Zstandard_)
exec pzstd -cdq
;;
*)
exec cat
;;
esac
;;
esac
-p, --pretty --color always --heading --line-number
-q, --quiet Do not print to stdout, if a match is found, stop rg This is useful when rg is used in exit code.
--regex-size-limit <NUM+SUFFIX?> Upper limit of compiled regex
-e, --regexp <PATTERN>... Use regular expressions to match This option can be used multiple times to print lines matching any pattern.
Can be used to search for patterns starting with -, such as rg -e -foo
-r, --replace <REPLACEMENT_TEXT> Print out the matching content with the corresponding file The group number ($5) can be used
-z, --search-zip Search in gz,bz2,xz,lzma,lz4 file types Can be disabled with --no-search-zip
-S, --smart-case If all lowercase, then it is case insensitive, otherwise it is case sensitive Can be turned off with -s/--case-sensitive and -i/--ignore-case
--sort-files Sort the output results by file path Will turn off parallel search threads
--stats Print out the statistical results
-a, --text Searching for binary files Can be disabled with --no-text
-j, --threads <NUM> Approximate number of threads used
-t, --type <TYPE>... Search only for certain file types You can use --type-lsit to list supported file types.
--type-add <TYPE_SPEC>... Adding a file type For example, rg --type-add 'foo:*.foo' -tfoo PATTERN
Can also be used to create a rule that includes multiple file types --type-add 'src:include:cpp,py,md'
--type-clear <TYPE>... Clear default file types
--type-list List all built-in file types
-T, --type-not <TYPE>... Do not search for a certain file type
-u, --unrestricted -u searches for files in .gitignore, -uu searches for hidden files -uuu Search for binary files
-V, --version Print version information
--vimgrep Print one line per match If there are multiple matches on a line, multiple lines will be printed.
-H, --with-filename Print matching file paths, default Can be disabled with --no-filename
-w, --word-regexp Matches pattern as a single word, equivalent to < >

Example display

Example 1

$ rg 'name' ./ 

Example 2

Search for content where name is an independent word (-w), equivalent to <pattern>

$ rg -w 'name' ./ 

Example 3

Print only file names containing matching content (-l)

$ rg -w 'name' ./ -l
src/cpp/epoll_server.cpp
src/cpp/uart_xtor.cpp

Example 4

Only search for cpp files (-t), you can use -T to exclude certain types of files

$ rg -w 'name' ./ -tcpp 

Example 5

Regular search (-e)

$ rg -e "sa.*port" ./ -tcpp 

Example 6

Display the matching content and two lines above and below (-C), similar to -A/-B

$ rg -e "sa.*port" ./ -tcpp -C2 

Example 7

Show lines without "debug" (-v)

$ rg -v "debug" -tcpp ./

Example 8

Show only matching parts (-o)

$ rg -e "if.*debug" ./ -tcpp -o 

Example 9

Ignore case (-i)

$ rg -ie "if.*debug" ./ -tcpp -o 

Example 10

Treat pattern as a constant character (-F), like .(){}*+ does not need escape. If the character to be searched starts with -, then use -- as a separator, or use rg -e "-foo"

rg -F "i++)" ./ -tcpp 

Example 11

Print all files that will be searched --files

rg --files 

Example 12

Output built-in recognition file type

$ rg --type-list
agda: *.agda, *.lagda
aidl: *.aidl
amake: *.bp, *.mk
asciidoc: *.adoc, *.asc, *.asciidoc
asm: *.S, *.asm, *.s
ats: *.ats, *.dats, *.hats, *.sats
avro: *.avdl, *.avpr, *.avsc
awk: *.awk
bazel: *.bzl, BUILD, WORKSPACE
bitbake: *.bb, *.bbappend, *.bbclass, *.conf, *.inc
bzip2: *.bz2
c: *.H, *.c, *.cats, *.h
cabal: *.cabal
cbor: *.cbor
ceylon: *.ceylon
clojure: *.clj, *.cljc, *.cljs, *.cljx
cmake: *.cmake, CMakeLists.txt
coffeescript: *.coffee
config: *.cfg, *.conf, *.config, *.ini
cpp: *.C, *.H, *.cc, *.cpp, *.cxx, *.h, *.hh, *.hpp, *.hxx, *.inl
creole: *.creole
crystal: *.cr, Projectfile
cs: *.cs
csharp: *.cs
cshtml: *.cshtml
css: *.css, *.scss
csv: *.csv
cython: *.pyx
d: *.d
dart: *.dart
dhall: *.dhall
docker: *Dockerfile*
elisp: *.el
elixir: *.eex, *.ex, *.exs
elm: *.elm
erlang: *.erl, *.hrl
fidl: *.fidl
fish: *.fish
fortran: *.F, *.F77, *.F90, *.F95, *.f, *.f77, *.f90, *.f95, *.pfo
fsharp: *.fs, *.fsi, *.fsx
gn: *.gn, *.gni
go: *.go
groovy: *.gradle, *.groovy
gzip: *.gz
h: *.h, *.hpp
haskell: *.c2hs, *.cpphs, *.hs, *.hsc, *.lhs
hbs: *.hbs
hs: *.hs, *.lhs
html: *.ejs, *.htm, *.html
idris: *.idr, *.lidr
java: *.java, *.jsp
jinja: *.j2, *.jinja, *.jinja2
jl: *.jl
js: *.js, *.jsx, *.vue
json: *.json, composer.lock
jsonl: *.jsonl
julia: *.jl
jupyter: *.ipynb, *.jpynb
kotlin: *.kt, *.kts
less: *.less
license: *[.-]LICEN[CS]E*, AGPL-*[0-9]*, APACHE-*[0-9]*, BSD-*[0-9]*, CC-BY-*, COPYING, COPYING[.-]*, COPYRIGHT, COPYRIGHT[.-]*, EULA, EULA[.-]*, GFDL-*[0-9]*, GNU-*[0-9]*, GPL-*[0-9]*, LGPL-*[0-9]*, LICEN[CS]E, LICEN[CS]E[.-]*, MIT-*[0-9]*, MPL-*[0-9]*, NOTICE, NOTICE[.-]*, OFL-*[0-9]*, PATENTS, PATENTS[.-]*, UNLICEN[CS]E, UNLICEN[CS]E[.-]*, agpl[.-]*, gpl[.-]*, lgpl[.-]*, licen[cs]e, licen[cs]e.*
lisp: *.el, *.jl, *.lisp, *.lsp, *.sc, *.scm
log: *.log
lua: *.lua
lz4: *.lz4
lzma: *.lzma
m4: *.ac, *.m4
make: *.mak, *.mk, GNUmakefile, Gnumakefile, Makefile, gnumakefile, makefile
man: *.[0-9][cEFMmpSx], *.[0-9lnpx]
markdown: *.markdown, *.md, *.mdown, *.mkdn
matlab: *.m
md: *.markdown, *.md, *.mdown, *.mkdn
mk: mkfile
ml: *.ml
msbuild: *.csproj, *.fsproj, *.proj, *.props, *.targets, *.vcxproj
nim: *.nim
nix: *.nix
objc: *.h, *.m
objcpp: *.h, *.mm
ocaml: *.ml, *.mli, *.mll, *.mly
org: *.org
pdf: *.pdf
perl: *.PL, *.perl, *.pl, *.plh, *.plx, *.pm, *.t
php: *.php, *.php3, *.php4, *.php5, *.phtml
pod: *.pod
protobuf: *.proto
ps: *.cdxml, *.ps1, *.ps1xml, *.psd1, *.psm1
puppet: *.erb, *.pp, *.rb
purs: *.purs
py: *.py
qmake: *.prf, *.pri, *.pro
r: *.R, *.Rmd, *.Rnw, *.r
rdoc: *.rdoc
readme: *README, README*
rst: *.rst
ruby: *.gemspec, *.rb, .irbrc, Gemfile, Rakefile
rust: *.rs
sass: *.sass, *.scss
scala: *.sbt, *.scala
sh: *.bash, *.bashrc, *.csh, *.cshrc, *.ksh, *.kshrc, *.sh, *.tcsh, *.zsh, .bash_login, .bash_logout, .bash_profile, .bashrc, .cshrc, .kshrc, .login, .logout, .profile, .tcshrc, .zlogin, .zlogout, .zprofile, .zshenv, .zshrc, bash_login, bash_logout, bash_profile, bashrc, profile, zlogin, zlogout, zprofile, zshenv, zshrc
smarty: *.tpl
sml: *.sig, *.sml
soy: *.soy
spark: *.spark
sql: *.psql, *.sql
stylus: *.styl
sv: *.h, *.sv, *.svh, *.v, *.vg
svg: *.svg
swift: *.swift
swig: *.def, *.i
systemd: *.automount, *.conf, *.device, *.link, *.mount, *.path, *.scope, *.service, *.slice, *.socket, *.swap, *.target, *.timer
taskpaper: *.taskpaper
tcl: *.tcl
tex: *.bib, *.cls, *.ltx, *.sty, *.tex
textile: *.textile
tf: *.tf
toml: *.toml, Cargo.lock
ts: *.ts, *.tsx
twig: *.twig
txt: *.txt
vala: *.vala
vb: *.vb
verilog: *.sv, *.svh, *.v, *.vh
vhdl: *.vhd, *.vhdl
vim: *.vim
vimscript: *.vim
webidl: *.idl, *.webidl, *.widl
wiki: *.mediawiki, *.wiki
xml: *.xml, *.xml.dist
xz: *.xz
yacc: *.y
yaml: *.yaml, *.yml
zsh: *.zsh, .zlogin, .zlogout, .zprofile, .zshenv, .zshrc, zlogin, zlogout, zprofile, zshenv, zshrc

Summarize

ripgrep's search speed is really fast, which helps me a lot when browsing code. I believe that its value to every programmer is infinite, especially when combined with FZF.
The only weakness is the support for regular expressions, but this is a trade-off. If a library like PCRE is used, the speed will definitely be greatly affected.

The above is the full content of this article. I hope it will be helpful for everyone’s study. I also hope that everyone will support 123WORDPRESS.COM.

You may also be interested in:
  • Problems with dynamic link library loading path and search path under Linux
  • Detailed examples of file search commands on Linux
  • The difference between Linux command search commands whereis and which
  • Search Files Command in Linux
  • Introduction to Linux file search and other basic commands (3)
  • Use Linux regular expressions to flexibly search for text in files
  • How to view the behavior of search engine spider crawlers in Linux/Nginx

<<:  Detailed explanation of the installation steps of the MySQL decompressed version

>>:  MySQL Binlog Data Recovery: Detailed Explanation of Accidentally Deleting a Database

Recommend

Detailed instructions for installing mysql5.7 database under centos7.2

The mysql on the server is installed with version...

Vue echarts realizes horizontal bar chart

This article shares the specific code of vue echa...

Detailed explanation of the getBoundingClientRect() method in js

1. getBoundingClientRect() Analysis The getBoundi...

How to view and close background running programs in Linux

1. Run the .sh file You can run it directly using...

Get the IP and host name of all hosts on Zabbix

zabbix Zabbix ([`zæbiks]) is an enterprise-level ...

Nginx routing forwarding and reverse proxy location configuration implementation

Three ways to configure Nginx The first method di...

setup+ref+reactive implements vue3 responsiveness

Setup is used to write combined APIs. The interna...

MySQL cross-database transaction XA operation example

This article uses an example to describe the MySQ...

How to use negative margin technology to achieve average layout in CSS

We usually use float layout to solve the compatib...

Docker+selenium method to realize automatic health reporting

This article takes the health reporting system of...

How to set mysql to case insensitive

mysql set to case insensitive Windows Go to the d...

Tutorial on installing and using virtualenv in Deepin

virtualenv is a tool for creating isolated Python...