com.apple.foundationdb.record.lucene (fdb-record-layer-lucene 4.5.5.0 API)

package com.apple.foundationdb.record.lucene

Support for LUCENE indexes and queries.

Lucene indexes are backed by FDB, using FDBDirectory to implement a virtual file system holding the inverted index files. This is not fundamental, though. This maintainer used standard IndexWriter and IndexReader, gotten with FDBDirectoryManager.getIndexWriter(com.apple.foundationdb.tuple.Tuple, java.lang.Integer) getIndexWriter}, for interfacing to Lucene.

The index definition can be grouped. Each group represents an entirely separate Lucene index.

Within a group, each record is represented by a single Lucene document. Fields to be included in the document are given by a concat expression. Unlike most indexes, the order of fields here does not matter for what queries are possible. Repeated record fields turn into multiple document fields. Fields in nested submessages, possibly repeated, are flattened into document fields with longer field names, representing the path through the record.

Basic support for correlation is provided by allowing a nested field's string value to contribute to the document field name. This is well suited to map-like fields where the keys are from a small, known set.

Fields are designated for full text tokenization, for storage in the Lucene document, and as refining field naming, by means of function key expressions.

The standard form of a Lucene index scan is a Lucene search query. A special LucenePlanner is able to synthesize these from regular query expressions and Lucene search syntax.

See Also:

Related Packages

Package

Description

com.apple.foundationdb.record.lucene.codec

Common classes for optimization of lucene's codec.

com.apple.foundationdb.record.lucene.directory

Common classes for lucene index queries.

com.apple.foundationdb.record.lucene.exact

Contains classes for analyzing stored and sorted fields.

com.apple.foundationdb.record.lucene.filter

Contains filter classes.

com.apple.foundationdb.record.lucene.highlight

Highlighting of matched terms found in using record-layer lucene integration.

com.apple.foundationdb.record.lucene.idformat

Common classes for lucene's ngram tokenizing.

com.apple.foundationdb.record.lucene.ngram

Common classes for lucene's ngram tokenizing.

com.apple.foundationdb.record.lucene.query

Common classes for handling bitsets.

com.apple.foundationdb.record.lucene.search

Common classes for parallel execution of search.

com.apple.foundationdb.record.lucene.synonym

Common classes for lucene's synonym tokenizing.
Class

Description

AlphanumericCjkAnalyzer

A CJK Analyzer which applies a minimum and maximum token length to non-CJK tokens.

AlphanumericLengthFilterFactory

A TokenFilterFactory that creates Alphanumeric Length filters.

AnalyzerChooser

Choose an Analyzer.

AutoCompleteAnalyzer

An analyzer that is used to analyze the auto_complete input.

EmailCjkSynonymAnalyzer

An analyzer that can handle emails, CJK, and synonyms.

EmailCjkSynonymAnalyzerFactory

Factory to build index and query Analyzer for EmailCjkSynonymAnalyzer.

LuceneAnalyzerCombinationProvider

Provide a combination of analyzers for multiple fields of one Lucene index.

LuceneAnalyzerFactory

Each implementation of Analyzer should have its own implementation of this factory interface to provide instances of the analyzers for indexing and query to a LuceneAnalyzerRegistry.

LuceneAnalyzerRegistry

Registry for AnalyzerChoosers.

LuceneAnalyzerRegistryImpl

Default implementation of the LuceneAnalyzerRegistry.

LuceneAnalyzerType

The type used to determine how the Analyzer built by LuceneAnalyzerFactory is used.

LuceneAnalyzerWrapper

A wrapper for Analyzer and its unique identifier.

LuceneAutoCompleteAnalyzerFactory

Factory to build index and query Analyzer for auto-complete suggestions.

LuceneAutoCompleteHelpers

This class provides some helpers for auto-complete functionality using Lucene auto complete suggestion lookup.

LuceneAutoCompleteHelpers.AutoCompleteTokens

Helper class to capture token information synthesized from a search key.

LuceneAutoCompleteQueryClause

Auto complete query clause from string using Lucene search syntax.

LuceneBooleanQuery

Binder for a conjunction of other clauses.

LuceneComparisonQuery

Wrapper of a Lucene Query that contains accessible field name, comparison type, and comparand.

LuceneConcurrency

Utility class for methods related to synchronizing Futures.

LuceneConcurrency.AsyncToSyncTimeoutException

An exception that is thrown when the async to sync operation times out.

LuceneDocumentFromRecord

Helper class for converting FDBRecords to Lucene documents.

LuceneDocumentFromRecord.DocumentField

LuceneDocumentFromRecord.DocumentFieldList<T extends LuceneIndexExpressions.RecordSource<T>>

LuceneDocumentFromRecord.FDBRecordSource<M extends Message>

A RecordSource based on an FDBRecord.

LuceneEvents

A StoreTimer events associated with Lucene operations.

LuceneEvents.Counts

Count events.

LuceneEvents.DetailEvents

Detail events.

LuceneEvents.Events

Main events.

LuceneEvents.SizeEvents

Size Events.

LuceneEvents.Waits

Wait events.

LuceneExceptions

Utility class for converting Lucene Exceptions to/from Record layer ones.

LuceneExceptions.LuceneTransactionTooOldException

A Wrapper around the transaction-too-old exception that gets thrown through Lucene as an IOException.

LuceneFunctionKeyExpression

Lucene function key expressions.

LuceneFunctionKeyExpression.LuceneFieldConfig

The key function for Lucene field configuration.

LuceneFunctionKeyExpression.LuceneFieldName

The lucene_field_name key function.

LuceneFunctionKeyExpression.LuceneSortBy

Key function representing one of the Lucene built-in sorting techniques.

LuceneFunctionKeyExpression.LuceneSorted

The lucene_sorted key function.

LuceneFunctionKeyExpression.LuceneStored

The lucene_stored key function.

LuceneFunctionKeyExpression.LuceneText

The lucent_text key function.

LuceneFunctionKeyExpressionFactory

Implemention of Lucene index key functions.

LuceneFunctionNames

Key function names for Lucene indexes.

LuceneFunctionNames.LuceneFieldIndexOptions

Option keys for LuceneFunctionNames.LUCENE_FULL_TEXT_FIELD_INDEX_OPTIONS.

LuceneGetMetadataInfo

Get metadata information about a given lucene index.

LuceneIndexExpressions

The root expression of a LUCENE index specifies how select fields of a record are mapped to fields of a Lucene document.

LuceneIndexExpressions.DocumentDestination<T extends LuceneIndexExpressions.RecordSource<T>>

An actual document / document meta-data.

LuceneIndexExpressions.DocumentFieldDerivation

Information about how a document field is derived from a record field.

LuceneIndexExpressions.DocumentFieldType

Possible types for document fields.

LuceneIndexExpressions.RecordSource<T extends LuceneIndexExpressions.RecordSource<T>>

An actual record / record meta-data.

LuceneIndexKeyValueToPartialRecordUtils

A utility class to build a partial record for an auto-complete suggestion value, with grouping keys if there exist.

LuceneIndexKeyValueToPartialRecordUtils.LuceneSpellCheckCopier

The copier to populate the lucene auto complete suggestion as a value for the field where it is indexed from.

LuceneIndexKeyValueToPartialRecordUtils.LuceneSpellCheckCopier.Deserializer

Deserializer.

LuceneIndexMaintainer

Index maintainer for Lucene Indexes backed by FDB.

LuceneIndexMaintainerFactory

Index Maintainer Factory for Lucene Indexes.

LuceneIndexOptions

Options for use with Lucene indexes.

LuceneIndexQueryPlan

Lucene query plan for including search-related scan parameters.

LuceneIndexQueryPlan.Deserializer

Deserializer.

LuceneIndexScrubbingToolsMissing

Index Scrubbing Toolbox for a Lucene index maintainer.

LuceneIndexScrubbingToolsMissing.MissingIndexReason

Provide a lucene specific reason for detecting a "missing" index entry.

LuceneIndexSpellCheckQueryPlan

Lucene query plan that allows to make spell-check suggestions.

LuceneIndexTypes

An index on the tokens in a text field.

LuceneIndexValidator

Validator for Lucene indexes.

LuceneLoggerInfoStream

Record Layer's implementation of InfoStream that publishes messages as TRACE logs.

LuceneLogMessageKeys

Lucene specific logging keys.

LuceneMetadataInfo

Metadata information about a lucene index, in response to LuceneGetMetadataInfo.

LuceneMetadataInfo.LuceneInfo

Information about an individual Lucene directory.

LuceneNotQuery

Binder for a negation of clauses.

LucenePartitioner

Manage partitioning info for a logical, partitioned lucene index, in which each partition is a separate physical lucene index.

LucenePartitioner.RepartitioningLogMessages

encapsulate and manage additional log messages when repartitioning.

LucenePlanner

A planner to implement lucene query planning so that we can isolate the lucene functionality to a distinct package.

LucenePrimaryKeySegmentIndex

Maintain a B-tree index of primary key to segment and doc id.

LucenePrimaryKeySegmentIndex.DocumentIndexEntry

Result of LucenePrimaryKeySegmentIndex.findDocument(org.apache.lucene.index.DirectoryReader, com.apple.foundationdb.tuple.Tuple).

LucenePrimaryKeySegmentIndexV1

Maintain a B-tree index of primary key to segment and doc id.

LucenePrimaryKeySegmentIndexV1.StoredFieldsReaderSegmentInfo

Hook for getting back segment info during merge.

LucenePrimaryKeySegmentIndexV2

Maintain a B-tree index of primary key to segment and doc id.

LuceneQueryClause

Binder for a single query clause.

LuceneQueryClause.BoundQuery

Helper class to capture a bound query, i.e.

LuceneQueryComponent

A Query Component for Lucene that wraps the query supplied.

LuceneQueryFieldComparisonClause

Query clause using a Comparisons.Comparison against a document field.

LuceneQueryMultiFieldSearchClause

Query clause from string using Lucene search syntax.

LuceneQuerySearchClause

Query clause from string using Lucene search syntax.

LuceneQueryType

The type of component.

LuceneRecordContextProperties

The list of RecordLayerPropertyKey for configuration of the lucene indexing for a FDBRecordContext.

LuceneRecordCursor

This class is a Record Cursor implementation for Lucene queries.

LuceneRecordCursor.ScoreDocIndexEntry

An IndexEntry based off a Lucene ScoreDoc.

LuceneRepartitionPlanner

Manage repartitioning details (merging small partitions and splitting large ones).

LuceneRepartitionPlanner.RepartitioningContext

Convenience collection of data needed for repartitioning.

LuceneScanBounds

Base class for IndexScanBounds used by LUCENE indexes.

LuceneScanParameters

Base class for IndexScanParameters used by LUCENE indexes.

LuceneScanQuery

Scan a LUCENE index using a Lucene Query.

LuceneScanQueryParameters

Scan parameters for making a LuceneScanQuery.

LuceneScanQueryParameters.Deserializer

Deserializer.

LuceneScanQueryParameters.LuceneQueryHighlightParameters

The parameters for highlighting matching terms of a Lucene search.

LuceneScanSpellCheck

Scan a LUCENE index for auto-complete suggestions.

LuceneScanSpellCheckParameters

Scan parameters for making a LuceneScanSpellCheck.

LuceneScanSpellCheckParameters.Deserializer

Deserializer.

LuceneScanTypes

IndexScanTypes for Lucene.

LuceneSpellCheckRecordCursor

Cursor over Lucene spell-check query results.

RegistrySynonymGraphFilterFactory

A SynonymGraphFilterFactory which uses an underlying Registry to statically cache synonym mappings, which is _significantly_ more efficient when using lots of distinct analyzers (such as during highlighting, or with lots of parallel record stores).

Package com.apple.foundationdb.record.lucene