Spreadsheets
Spreadsheets can be indexed by exporting them to .tsv and then using TSVIndexer to turn each row into a document.
TSVIndexer
The TSVIndexer class is responsible for indexing TSV (Tab-Separated Values) files into a catalog.
Example
const tsvIndexer = new TSVIndexer(catalog, "path/to/file.tsv", {
batchSize: 100,
fieldMapping: {
"Original Column": "mappedColumn",
},
});
await tsvIndexer.index();Constructor
constructor(catalog: Catalog, file: string, opts?: TSVIndexerOpts)Creates a new instance of the TSVIndexer.
catalog: The catalog to index the documents into.file: The path to the TSV file to be indexed.opts: Optional configuration options for the indexer.
TSVIndexerOpts
batchSize?: number: The number of documents to process in each batch.getId?: (item: any) => string: A function to extract the document ID from an item.getUrl?: (item: any) => string: A function to extract the URL from an item.getImageUrl?: (item: any) => string: A function to extract the image URL from an item.fieldMapping?: { [key: string]: string }: A mapping of TSV column names to desired field names.
Methods
index()
public async index(): Promise<void>Indexes the TSV file into the catalog. This method reads the TSV file, processes its contents, and adds the documents to the catalog using a JSONIndexer.
Returns: A Promise that resolves when the indexing process is complete.