查询语法
了解如何使用查询语法
基本语法
您可以使用以下规则对复杂查询使用简单语法:
-
精确短语用引号括起来,例如
"hello world"
. -
多词短语是标记列表,例如
foo bar baz
,并暗示项的交集 (AND)。 -
OR
联合用竖线 (|
) 字符,例如hello|hallo|shalom|hola
.注意:
考虑示例中解析器行为的差异
hello world | "goodbye" moon
:- 在 DIALECT 1 中,此查询被解释为搜索
(hello world | "goodbye") moon
. - 在 DIALECT 2 或更高版本中,此查询被解释为搜索
hello world
或"goodbye" moon
.
- 在 DIALECT 1 中,此查询被解释为搜索
-
NOT
表达式或子查询的否定用减法符号 () 表示,例如,-
hello -world
.纯否定查询,例如-foo
和-@title:(foo|bar)
也受支持。注意:
考虑一个带有否定的简单查询
-hello world
:- 在 DIALECT 1 中,此查询被解释为“在不包含
hello
AND 不包含world
".等效项为-(hello world)
或-hello -world
. - 在 DIALECT 2 或更高版本中,将解释此查询
as -hello
和world
(仅限hello
是否定的)。 - 在 DIALECT 2 或更高版本中,要实现 DIALECT 1 的默认行为,请将查询更新为
-(hello world)
.
- 在 DIALECT 1 中,此查询被解释为“在不包含
-
前缀/中缀/后缀匹配(所有以术语开头/包含/结尾的术语)都用星号表示。出于性能原因,强制实施最短期限长度。默认值为 2,但可配置。
*
-
在 DIALECT 2 或更高版本中,通配符模式匹配表示为
"w'foo*bar?'"
.请注意,使用双引号来包含 w 模式。 -
返回 index 中所有结果的特殊通配符查询只是 asterisk 。这不能与其他选项结合使用。
*
-
自 v2.6.1 起,
DIALECT 3
返回 JSON,而不是来自多值属性的标量。 -
使用语法选择特定字段
hello @field:world
. -
使用语法
@field:[{min} {max}]
. -
Georadius 对具有语法的 geo 字段进行匹配
@field:[{lon} {lat} {radius} {m|km|mi|ft}]
. -
从 2.6 开始,对向量字段进行范围查询的语法为
@field:[VECTOR_RANGE {radius} $query_vec]
哪里query_vec
作为查询参数给出。 -
从 v2.4 开始,k-最近邻 (KNN) 对向量字段进行查询,无论是否使用语法
{filter_query}=>[KNN {num} @field $query_vec]
. -
使用语法的标记字段过滤器
@field:{tag | tag | ...}
.请参阅有关标记的完整文档。 -
可选条款或条款:
foo ~bar
表示 bar 是可选的,但文档包含bar
将排名更高。 -
术语的模糊匹配:
%hello%
表示 Levenshtein 距离为 1 的所有项。使用多对 '%' 括号,最多三对,以增加 Levenshtein 距离。 -
查询中的表达式可以括在括号中以消除歧义,例如,
(hello|hella) (world|werld)
. -
查询属性可以应用于单个子句,例如
(foo bar) => { $weight: 2.0; $slop: 1; $inorder: false; }
. -
以上组合可以一起使用,例如:
hello (world|foo) "bar baz" bbbb
.
纯否定查询
从 v0.19.3 开始,查询可以只包含一个否定表达式。例如-hello
或-(@title:(foo|bar))
.结果是所有不包含查询词的文档。
字段修饰符
您可以在查询中指定字段修饰符,而不仅仅是使用INFIELDS
global 关键字。
要指定查询匹配的字段,请在表达式前面加上符号、字段名称和@
:
(冒号) 符号,对于每个表达式或子表达式。
如果字段修饰符位于多个单词或表达式之前,则它仅适用于具有 DIALECT 1 的相邻表达式。使用 DIALECT 2 或更高版本,可以将查询扩展到其他字段。
请考虑以下简单查询:@name:James Brown
.此处,字段修饰符@name
后面跟着两个字:James
和Brown
.
- 在 DIALECT 1 中,此查询将解释为“查找
James Brown
在@name
字段”。 - 在 DIALECT 2 或更高版本中,此查询将被解释为“find
James
在@name
field 和Brown
在 ANY 文本字段中。换句话说,它将被解释为(@name:James) Brown
. - 在 DIALECT 2 或更高版本中,要实现 DIALECT 1 的默认行为,请将查询更新为
@name:(James Brown)
.
如果字段修饰符位于括号中的表达式之前,则它仅适用于括号内的表达式。表达式应对指定字段有效,否则将跳过该表达式。
要对多个字段创建复杂的筛选,您可以组合多个修饰符。例如,如果您有一个汽车型号索引,其中包含车辆类别、原产国和发动机类型,则可以使用以下查询搜索韩国制造的混合动力或柴油发动机 SUV:
FT.SEARCH cars "@country:korea @engine:(diesel|hybrid) @class:suv"
您可以将多个修饰符应用于同一术语或分组术语:
FT.SEARCH idx "@title|body:(hello world) @url|image:mydomain"
现在,您可以搜索具有"hello"
和"world"
在正文或标题和术语中mydomain
在他们的url
或image
领域。
查询中的数值筛选器
如果架构中的字段定义为 NUMERIC,则可以在 Redis 请求中使用 FILTER 参数,也可以通过在查询中指定筛选规则来使用 FILTER 参数。语法为@field:[{min} {max}]
例如@price:[100 200]
.
关于数字谓词的几点说明
-
可以将数字谓词指定为整个查询,而无法使用
FILTER
论点。 -
可以在同一查询中交叉或合并多个数字过滤器,无论是针对同一字段还是不同字段。
-
-inf
,inf
和+inf
是范围内可接受的数字。因此,大于 100 表示为[(100 inf]
. -
数字筛选器是非独占的。不包括 min 或 max 在数字前加上前缀,例如
(
[(100 (200]
. -
可以通过在过滤器前面加上一个符号来否定数字过滤器。例如,返回 price 不同于 100 的结果表示为:
-
@title:foo -@price:[100 100]
.
标签过滤器
从 v0.91 开始,您可以使用一种称为 tag 字段的特殊字段类型,在索引中使用更简单的分词和编码。您无法使用常规无字段搜索来访问这些字段中的值。相反,您可以使用特殊语法:
@field:{ tag | tag | ...}
例:
@cities:{ New York | Los Angeles | Barcelona }
标记可以包含多个单词,也可以包含除字段分隔符 (,
默认情况下)。标签中的以下字符应使用反斜杠 () 进行转义: 、 、 、 和\
$
{
}
\
|
.
DIALECT 2
或更大,您可以在tag
query 中,即使使用停用词也是如此。请注意,同一子句中的多个标记会创建包含任一标记的文档的联合。要创建包含所有标签的文档的交集,您应该多次重复标签过滤器。例如:
# Return all documents containing all three cities as tags
@cities:{ New York } @cities:{Los Angeles} @cities:{ Barcelona }
# Now, return all documents containing either city
@cities:{ New York | Los Angeles | Barcelona }
标记子句可以组合成任何子子句,用作否定表达式、可选表达式等。
地理过滤器
从 v0.21 开始,可以使用语法将 geo radius 查询直接添加到查询语言中@field:[{lon} {lat} {radius} {m|km|mi|ft}]
.这会将结果从以米、千米、英里或英尺定义的 lon,lat 点筛选为给定半径。查看 Redis 自己的GEORADIUS
命令了解更多详情。
半径过滤器可以像数字过滤器一样添加到查询中。例如,在企业数据库中,寻找旧金山附近(半径 5 公里内)的中餐馆将表示为:chinese restaurant @location:[-122.41 37.77 5 km]
.
多边形搜索
地理空间数据库对于管理和分析各种行业中基于位置的数据至关重要。它们帮助组织做出数据驱动的决策、优化运营并更高效地实现其战略目标。多边形搜索扩展了 Redis 的地理空间搜索功能,以便能够查询GEOSHAPE
属性。此值必须遵循几何图形的“已知文本”(WKT) 表示形式。支持两种这样的几何图形:
POINT
例如POINT(2 4)
.POLYGON
例如POLYGON((2 2, 2 8, 6 11, 10 8, 10 2, 2 2))
.
有一个名为GEOSHAPE
,可以指定为:
FLAT
对于笛卡尔 X Y 坐标SPHERICAL
以获取地理经度和纬度坐标。这是默认坐标系。
最后,还有新的FT.SEARCH
语法,用于查询包含或位于给定 GeoShape 中的多边形。
@field:[{WITHIN|CONTAINS} $geometry] PARAMS 2 geometry {geometry}
下面是一个使用两个堆叠多边形的示例,这两个多边形表示房屋中包含的一个框。

首先,使用FLAT
GEOSHAPE
,表示 2D X Y 坐标系。
FT.CREATE polygon_idx PREFIX 1 shape: SCHEMA g GEOSHAPE FLAT t TEXT
接下来,创建表示图片中几何图形的数据结构。
HSET shape:1 t "this is my house" g "POLYGON((2 2, 2 8, 6 11, 10 8, 10 2, 2 2))"
HSET shape:2 t "this is a square in my house" g "POLYGON((4 4, 4 6, 6 6, 6 4, 4 4))"
Finally, use
FT.SEARCH
to query the geometries. Note the use of DIALECT 3
, which is required. Here are a few examples.
Search for a polygon that contains a specified point:
FT.SEARCH polygon_idx "@g:[CONTAINS $point]" PARAMS 2 point 'POINT(8 8)' DIALECT 3
1) (integer) 1
2) "shape:1"
3) 1) "t"
2) "this is my house"
3) "g"
4) "POLYGON((2 2, 2 8, 6 11, 10 8, 10 2, 2 2))"
Search for geometries contained in a specified polygon:
FT.SEARCH polygon_idx "@g:[WITHIN $poly]" PARAMS 2 poly 'POLYGON((0 0, 0 100, 100 100, 100 0, 0 0))' DIALECT 3
1) (integer) 2
2) "shape:2"
3) 1) "t"
2) "this is a square in my house"
3) "g"
4) "POLYGON((4 4, 4 6, 6 6, 6 4, 4 4))"
4) "shape:1"
5) 1) "t"
2) "this is my house"
3) "g"
4) "POLYGON((2 2, 2 8, 6 11, 10 8, 10 2, 2 2))"
Search for a polygon that is not contained in the indexed geometries:
FT.SEARCH polygon_idx "@g:[CONTAINS $poly]" PARAMS 2 poly 'POLYGON((14 4, 14 6, 16 6, 16 4, 14 4))' DIALECT 3
1) (integer) 0
Search for a polygon that is known to be contained within the geometries (the box):
FT.SEARCH polygon_idx "@g:[CONTAINS $poly]" PARAMS 2 poly 'POLYGON((4 4, 4 6, 6 6, 6 4, 4 4))' DIALECT 3
1) (integer) 2
2) "shape:1"
3) 1) "t"
2) "this is my house"
3) "g"
4) "POLYGON((2 2, 2 8, 6 11, 10 8, 10 2, 2 2))"
4) "shape:2"
5) 1) "t"
2) "this is a square in my house"
3) "g"
4) "POLYGON((4 4, 4 6, 6 6, 6 4, 4 4))"
Note that both the house and box shapes were returned.
Note:
GEOSHAPE does not support JSON multi-value or SORTABLE options.
For more examples, see the FT.CREATE
and FT.SEARCH
command pages.
Vector search
You can add vector similarity queries directly into the query language by:
-
Using a range query with the syntax of @vector:[VECTOR_RANGE {radius} $query_vec]
, which filters the results to a given radius from a given query vector. The distance metric derives from the definition of a @vector field in the index schema, for example, Cosine or L2 (as of v2.6.1).
-
Running a k nearest neighbors (KNN) query on a @vector field. The basic syntax is "*=>[ KNN {num|$num} @vector $query_vec ]"
.
It is also possible to run a hybrid query on filtered results. A hybrid query allows the user to specify a filter criteria that all results in a KNN query must satisfy. The filter criteria can include any type of field (i.e., indexes created on both vectors and other values, such as TEXT, PHONETIC, NUMERIC, GEO, etc.).
The general syntax for hybrid query is {some filter query}=>[ KNN {num|$num} @vector $query_vec]
, where =>
separates the filter query from the vector KNN query.
Examples:
-
Return 10 nearest neighbors entities in which query_vec
is closest to the vector stored in @vector_field
:
*=>[KNN 10 @vector_field $query_vec]
-
Among entities published between 2020 and 2022, return 10 nearest neighbors entities in which query_vec
is closest to the vector stored in @vector_field
:
@published_year:[2020 2022]=>[KNN 10 @vector_field $query_vec]
-
Return every entity for which the distance between the vector stored under its @vector_field and query_vec
is at most 0.5, in terms of the @vector_field distance metric:
@vector_field:[VECTOR_RANGE 0.5 $query_vec]
As of v2.4, the KNN vector search can be used at most once in a query, while, as of v2.6, the vector range filter can be used multiple times in a query. For more information on vector similarity syntax, see Querying vector fields, and Vector search examples sections.
Prefix matching
When indexes are updated, Redis maintains a dictionary of all terms in the index. This can be used to match all terms starting with a given prefix. Selecting prefix matches is done by appending *
to a prefix token. For example:
hel* world
Will be expanded to cover (hello|help|helm|...) world
.
A few notes on prefix searches
-
As prefixes can be expanded into many terms, use them with caution. The expansion will create a Union operation of all suffixes.
-
As a protective measure to avoid selecting too many terms, blocking Redis, which is single threaded, there are two limitations on prefix matching:
-
Prefixes are limited to 2 letters or more. You can change this number by using the MINPREFIX
setting on the module command line.
-
The minimum word length to stem is 4 letters or more. You can change this number by using the MINSTEMLEN
setting on the module command line.
-
Expansion is limited to 200 terms or less. You can change this number by using the MAXEXPANSIONS
setting on the module command line.
-
Prefix matching fully supports Unicode and is case insensitive.
-
Currently, there is no sorting or bias based on suffix popularity.
Infix/suffix matching
As of v2.6.0, the dictionary can be used for infix (contains) or suffix queries by appending *
to the token. For example:
*sun* *ing
These queries are CPU intensive because they require iteration over the whole dictionary.
Note:
All notes about prefix searches also apply to infix/suffix queries.
Using a suffix trie
A suffix trie maintains a list of terms that match the suffix. If you add a suffix trie to a field using the WITHSUFFIXTRIE
keyword, you can create more efficient infix and suffix queries because it eliminates the need to iterate over the whole dictionary. However, the iteration on the union does not change.
Suffix queries create a union of the list of terms from the suffix term node. Infix queries use the suffix terms as prefixes to the trie and create a union of all terms from all matching nodes.
Wildcard matching
As of v2.6.0, you can use the dictionary for wildcard matching queries with these parameters.
?
- for any single character
*
- for any character repeating zero or more times
- ' and \ - for escaping; other special characters are ignored
An example of the syntax is "w'foo*bar?'"
.
Using a suffix trie
A suffix trie maintains a list of terms which match the suffix. If you add a suffix trie to a field using the WITHSUFFIXTRIE
keyword, you can create more efficient wildcard matching queries because it eliminates the need to iterate over the whole dictionary. However, the iteration on the union does not change.
With a suffix trie, the wildcard pattern is broken into tokens at every *
character. A heuristic is used to choose the token with the least terms, and each term is matched with the wildcard pattern.
Fuzzy matching
As of v1.2.0, the dictionary of all terms in the index can also be used to perform fuzzy matching.
Fuzzy matches are performed based on Levenshtein distance (LD).
Fuzzy matching on a term is performed by surrounding the term with '%', for example:
%hello% world
This performs fuzzy matching on hello
for all terms where LD is 1.
As of v1.4.0, the LD of the fuzzy match can be set by the number of '%' characters surrounding it, so that %%hello%%
will perform fuzzy matching on 'hello' for all terms where LD is 2.
The maximum LD for fuzzy matching is 3.
Wildcard queries
As of v1.1.0, you can use a special query to retrieve all the documents in an index. This is meant mostly for the aggregation engine. You can call it by specifying only a single star sign as the query string, in other words, FT.SEARCH myIndex *
.
You can't combine this with any other filters, field modifiers, or anything inside the query. It is technically possible to use the deprecated FILTER
and GEOFILTER
request parameters outside the query string in conjunction with a wildcard, but this makes the wildcard meaningless and only hurts performance.
Query attributes
As of v1.2.0, you can apply specific query modifying attributes to specific clauses of the query.
The syntax is (foo bar) => { $attribute: value; $attribute:value; ...}
:
(foo bar) => { $weight: 2.0; $slop: 1; $inorder: true; }
~(bar baz) => { $weight: 0.5; }
The supported attributes are:
- $weight: determines the weight of the sub-query or token in the overall ranking on the result (default: 1.0).
- $slop: determines the maximum allowed slop (space between terms) in the query clause (default: 0).
- $inorder: whether or not the terms in a query clause must appear in the same order as in the query. This is usually set alongside with
$slop
(default: false).
- $phonetic: whether or not to perform phonetic matching (default: true). Note: setting this attribute to true for fields which were not created as
PHONETIC
will produce an error.
As of v2.6.1, the query attributes syntax supports these additional attributes:
- $yield_distance_as: specifies the distance field name, used for later sorting and/or returning, for clauses that yield some distance metric. It is currently supported for vector queries only (both KNN and range).
- vector query params: pass optional parameters for vector queries in key-value format.
A few query examples
-
Simple phrase query - hello
AND world
:
hello world
-
Exact phrase query - hello
FOLLOWED BY world
:
"hello world"
-
Union - documents containing either hello
OR world
:
hello|world
-
Not - documents containing hello
BUT NOT world
:
hello -world
-
Intersection of unions:
(hello|halo) (world|werld)
-
Negation of union:
hello -(world|werld)
-
Union inside phrase:
(barack|barrack) obama
-
Optional terms with higher priority to ones containing more matches:
obama ~barack ~michelle
-
Exact phrase in one field, one word in another field:
@title:"barack obama" @job:president
-
Combined AND, OR with field specifiers:
@title:"hello world" @body:(foo bar) @category:(articles|biographies)
-
Prefix/infix/suffix queries:
hello worl*
hel* *worl
hello -*worl*
-
Wildcard matching queries:
"w'foo??bar??baz'"
"w'???????'"
"w'hello*world'"
-
Numeric filtering - products named tv
with a price range of 200 to 500:
@name:tv @price:[200 500]
-
Numeric filtering - users with age greater than 18:
@age:[(18 +inf]
Mapping common SQL predicates to Redis Query Engine
SQL Condition
Redis Query Engine Equivalent
Comments
WHERE x='foo' AND y='bar'
@x:foo @y:bar
for less ambiguity use (@x:foo) (@y:bar)
WHERE x='foo' AND y!='bar'
@x:foo -@y:bar
WHERE x='foo' OR y='bar'
(@x:foo)|(@y:bar)
WHERE x IN ('foo', 'bar','hello world')
@x:(foo|bar|"hello world")
quotes mean exact phrase
WHERE y='foo' AND x NOT IN ('foo','bar')
@y:foo (-@x:foo) (-@x:bar)
WHERE x NOT IN ('foo','bar')
-@x:(foo|bar)
WHERE num BETWEEN 10 AND 20
@num:[10 20]
WHERE num >= 10
@num:[10 +inf]
WHERE num > 10
@num:[(10 +inf]
WHERE num < 10
@num:[-inf (10]
WHERE num <= 10
@num:[-inf 10]
WHERE num < 10 OR num > 20
@num:[-inf (10] | @num:[(20 +inf]
WHERE name LIKE 'john%'
@name:john*
Technical notes
The query parser is built using the Lemon Parser Generator and a Ragel based lexer. You can see the DIALECT 2
grammar definition at this git repo.
You can also see the DEFAULT_DIALECT configuration parameter.
On this page