Splainer Search

Splainer Search is a plain-JavaScript (ESM) search library that supports Solr, OpenSearch and Elasticsearch and is focused on relevance diagnostics with some experimental support for other search engines, including Algolia and Vectara.

It's used in the relevancy tuning tools Quepid and Splainer. It is available for anyone to use (see license).

Note: From v3.0.0 onward, splainer-search is a plain ESM library with no AngularJS dependency. The npm badge above shows the latest published version; if you are still on 2.x, see CHANGELOG.md for migration notes — most notably, angular.forEach examples below have become plain for…of loops. For Splainer and Quepid, prefer the wired entry (below).

Splainer, Quepid, and other full apps

Use the splainer-search/wired.js (or splainer-search/wired) entry so you get the same service graph the library tests use (fieldSpecSvc, searchSvc, createSearcher, ...) without copying wiring into your repo:

import { createWiredServices, createFetchClient } from 'splainer-search/wired.js';

var httpClient = createFetchClient({
  credentials: 'include', // optionally: send cookies on cross-origin requests (Solr/search API must allow CORS)
  // fetch: globalThis.fetch, // optionally: custom fetch for CSRF headers, tracing, or tests
});
var api = createWiredServices(httpClient);
var fieldSpecSvc = api.fieldSpecSvc;
var searchSvc = api.searchSvc;

var fields = fieldSpecSvc.createFieldSpec('id title hl:body author');
var searcher = searchSvc.createSearcher(
  fields,
  'http://localhost:8983/solr/select',
  { q: ['*:*'] }
);

<script> tags (no bundler): run npm run build in this repo so dist/ exists, or use the same files from a published install under node_modules/splainer-search/dist/. URI.js must load first (the IIFE expects globalThis.URI). Then load either the wired bundle or the barrel bundle:

Script	Global	Use when
`dist/splainer-search-wired.js`	`SplainerSearchWired`	You want `createWiredServices` / `createFetchClient` like the ESM example above (Splainer / Quepid style).
`dist/splainer-search.js`	`SplainerSearch`	You import named constructors only and wire dependencies yourself.

Example (wired bundle — adjust script src paths to where you serve URI.js and the vendored splainer-search-wired.js file):

<script src="https://cdn.jsdelivr.net/npm/urijs@1.19.11/src/URI.min.js"></script>
<script src="./node_modules/splainer-search/dist/splainer-search-wired.js"></script>
<script>
  var httpClient = SplainerSearchWired.createFetchClient({
    credentials: 'include', // optional: cross-origin cookies (needs CORS)
  });
  var api = SplainerSearchWired.createWiredServices(httpClient);
  var fields = api.fieldSpecSvc.createFieldSpec('id title hl:body author');
  var searcher = api.searchSvc.createSearcher(
    fields,
    'http://localhost:8983/solr/select',
    { q: ['*:*'] }
  );
</script>

Splainer Search utilizes a JSONP wrapper for communication with Solr. Elasticsearch, OpenSearch, Algolia, and Vectara use simple HTTP and JSON via CORS. For Algolia, Vectara, and Elasticsearch/OpenSearch, you can pass extra HTTP headers in config.customHeaders as a JSON string (see the examples below); the library merges that into outbound requests. Solr can use the same field when using GET/POST transports that accept headers.

All fields are explained and highlighted when the backend supports it and you request it. A friendly interface is provided to specify the arguments in terms of a JavaScript object. See below for basic examples.

Basic usage

The first argument to searchSvc.createSearcher must be a field spec object from fieldSpecSvc.createFieldSpec(string) (for example 'id title hl:body' or 'id:_id title'). It is not a plain JavaScript array of field names. The snippets below assume searchSvc and fieldSpecSvc come from createWiredServices (as in the example above).

Solr

Splainer-search will perform the specified search against Solr attempting to highlight and extract explain info. To request highlighting on a specific field, prefix the fieldname with "hl:" i.e: hl:overview.

// searcher that searches id, title, body, author (hl: marks highlight fields)
var fields = fieldSpecSvc.createFieldSpec('id title hl:body author');
var searcher = searchSvc.createSearcher(
  fields,
  'http://localhost:8983/solr/select',
  {
    'q': ['*:*'],
    'fq': ['title:Moby*', 'author:Herman']
  }
);

searcher.search()
.then(function() {
  for (const doc of searcher.docs) {
    console.log(doc.source().title);
    // highlights. You need to pass id as that's how Solr
    // organizes the explain. See below for a friendlier/higher-level
    // interface with normalDocs
    console.log(doc.highlight(doc.source().id, 'title', '<b>', '</b>'));
    // explain info
    console.log(doc.explain(doc.source().id));
  }
});

Elasticsearch and OpenSearch

Splainer-search supports these search engines using the same client code path and the query DSL shape Elasticsearch expects. Pass 'es' or 'os' as the sixth argument to createSearcher so the library does not default to Solr. OpenSearch uses the same implementation as Elasticsearch; 'os' is mainly for correct labeling and for tooling (for example URL validation) that cares which engine you use.

The fourth argument is the query string used to replace #$query## (and related template placeholders) in the DSL; the fifth is optional config (see Specifying search engine version number).

var fields = fieldSpecSvc.createFieldSpec('id:_id title body author');
var searcher = searchSvc.createSearcher(
  fields,
  'http://localhost:9200/books/_search',
  {
    'query': {
      'match': {
        'title': '#$query##'
      }
    }
  },
  'moby dick',
  {},
  'es' // use 'os' for OpenSearch
);

Algolia

Splainer-search has experimental support for Algolia search. Use an index query URL (…/1/indexes/{index}/query), pass the sixth argument as 'algolia', and supply Algolia headers on config.customHeaders (a JSON string). The query string placeholder #$query## in the request body is replaced with the queryText argument, same as for other engines.

var fields = fieldSpecSvc.createFieldSpec('id:objectID title description');
var searcher = searchSvc.createSearcher(
  fields,
  'https://YOUR_APP_ID-dsn.algolia.net/1/indexes/products/query',
  {
    query: '#$query##',
    hitsPerPage: 10,
    page: 0,
    attributesToRetrieve: ['objectID', 'title', 'description'],
  },
  'laptop',
  {
    apiMethod: 'POST',
    customHeaders: JSON.stringify({
      'X-Algolia-Application-Id': 'YOUR_APP_ID',
      'X-Algolia-API-Key': 'YOUR_SEARCH_API_KEY',
    }),
  },
  'algolia'
);

Use apiMethod: 'GET' or 'POST' depending on how you want the query sent; the tests cover both. Explain and some other advanced Splainer-search features are not available for Algolia in the same way as for Solr or Elasticsearch.

Vectara

Splainer-search has experimental support for Vectara. You can send queries in the Vectara format but must also pass in the authorization headers as custom headers, e.g.

var fields = fieldSpecSvc.createFieldSpec('id:_id title body author');
var searcher = searchSvc.createSearcher(
  fields,
  'https://api.vectara.io:443/v1/query',
  {
    "query": [
      {
        "query": "#$query##",
        "numResults": 10,
        "corpusKey": [
          {
            "customerId": 123456789,
            "corpusId": 1
          }
        ]
      }
    ]
  },
  'your query text',
  {
    customHeaders: JSON.stringify({
      'customer-id': '123456789',
      'x-api-key': 'api_key',
    }),
  },
  'vectara'
);

Please note that the Vectara integration currently does not support explain or other advanced Splainer-search functionality.

Custom Search API

Splainer-search has experimental support for Custom APIs. You can send in queries as GET or POST and your API must respond with a JSON formatted response.

The magic of the Custom Search API is that you provide some mapping JavaScript code to convert from the JSON format of your API to the native structures that splainer-search uses. Imagine your response looks like:

[
    {
        "publication_id": "12345678",
        "publish_date_int": "20230601",
        "score": 0.5590707659721375,
        "title": "INFOGRAPHIC: Automakers' transition to EVs speeds up"
    },
    {
        "publication_id": "1234567",
        "publish_date_int": "20230608",
        "score": 0.5500463247299194,
        "title": "Tesla - March 2023 (LTM): Peer Snapshot"
    }
];

Then you would define two custom mappers, where data is your JSON:

var options = { apiMethod: 'GET' };
options.numberOfResultsMapper = function(data){
  return data.length;
}
options.docsMapper = function(data){    
  let docs = [];
  for (let doc of data) {
    docs.push ({
      id: doc.publication_id,
      publish_date_int: doc.publish_date_int,
      title: doc.title,
    })
  }
  return docs
}

Pass options as the fifth argument (searcher config), and use an args object whose values can carry the #$query## placeholder (here the key is query). The fourth argument is the query text applied to that template.

var fields = fieldSpecSvc.createFieldSpec('id:publication_id title publish_date_int');
var args = { query: '#$query##' };
var searcher = searchSvc.createSearcher(
  fields,
  'http://mycompany.com/api/search',
  args,
  'tesla',
  options,
  'searchapi'
);

Paging

Paging is done by asking the original searcher for another searcher. This searcher is already setup to get the next page for the current search results. Tell that searcher to search() just like you did above.

var results = [];
searcher.search()
.then(function() {
  for (const doc of searcher.docs) {
    results.push(doc.source().title);
  }
  // once results returned, get a new searcher for the next
  // page of results, just rerun the search later exactly as
  // its run here
  searcher = searcher.pager();
});

// sometime later we page...
searcher.search()
.then(function() {

});

Explain Other

Let's say you have performed a search for tacos and you get a bunch of results, but the chef comes back to you and says:

Hey! My new creation "La Bomba" is not showing up, fix it!!!!

So you are puzzled as to why it is not showing up, since it's clearly marked as a taco in the db. Wouldn't it be nice if splainer-search gave you some help?

Don't worry, we've got your back :)

Solr

So assuming you already have something like this:

var fields = fieldSpecSvc.createFieldSpec('id title price');
var options = {
  url:          'http://localhost:8983/solr/select',
  args:         { 'q': ['#$query##'] },
  query:        'tacos',
  config:       {},
  searchEngine: 'solr'
};
var searcher = searchSvc.createSearcher(fields, options.url, options.args, options.query, options.config, options.searchEngine);

searcher.search();

You would want to create a new searcher with the same options/context, and use the explainOther() function:

var explainSearcher = searchSvc.createSearcher(fields, options.url, options.args, options.query, options.config, options.searchEngine); // same options as above

// assuming that we know "El Bomba" has id 63148
explainSearcher.explainOther('id:63148', fields);

The explainOther() function returns the same promise as the search() function so you can retrieve the results in the same way.

Elasticsearch / OpenSearch

In ES/OS, the explainOther() function behaves the same way, except that it does not need a fieldSpec param to be passed in.

var fields = fieldSpecSvc.createFieldSpec('id title price');
var options = {
  url:          'http://localhost:9200/tacos/_search',
  args:         {
    'query': {
      'match': {
        'title': '#$query##'
      }
    }
  },
  query:        'tacos',
  config:       {},
  searchEngine: 'es' // or 'os' for OpenSearch
};
var searcher = searchSvc.createSearcher(fields, options.url, options.args, options.query, options.config, options.searchEngine);

searcher.search();

var explainSearcher = searchSvc.createSearcher(fields, options.url, options.args, options.query, options.config, options.searchEngine); // same options as above

// assuming that we know "El Bomba" has id 63148
explainSearcher.explainOther('id:63148');

The explainOther() function returns the same promise as the search() function so you can retrieve the results in the same way.

Normalizing docs with normalDocs/fieldSpec

This library was originally written for dealing with debug tools such as Quepid and Splainer. As such, it provides a lot of help taking a user specified list of fields and associated roles, then once search is done turning the raw docs out of the Solr searcher into something more normalized based on that config (a normalDoc).

The normalDoc provides a friendlier, more standard interface. This includes friendlier parsing of explain information as needed.

var userFieldSpec = "id:uuid, title, body, authors"
var fs = fieldSpecSvc.createFieldSpec(userFieldSpec)
var searcher = searchSvc.createSearcher(
  fs,
  'http://localhost:8983/solr/select',
  {
    'q': ['*:*'],
    'fq': ['title:Moby*', 'authors:Herman']
  }
);

searcher.search()
.then(function() {
  var bestScore = 0;
  for (const doc of searcher.docs) {
    var normalDoc = normalDocsSvc.createNormalDoc(fs, doc);
    // access unique id and title
    // (above specified to be uuid and title)
    console.log("ID is:" + normalDoc.id);
    console.log("Title is:" + normalDoc.title);

    // snippets -- best try to highlight the field
    for (const [fieldName, snippet] of Object.entries(normalDoc.subSnippets)) {
      console.log('hopefully this is a highlight! ' + snippet);
    }

    // prettier and heavily sanitized explain info:
    // (the explain modal on Splainer shows this)
    console.log(normalDoc.explain());

    // hot matches contains the most important matches
    // this drives the horizontal graph bars in Quepid/Splainer
    var matches = normalDoc.hotMatches();

    // Give hotMatchesOutOf a maximum score (for all docs returned) and you'll
    // get the hot matches as a percentage of the whole
    if (normalDoc.score() > bestScore) {
      bestScore = normalDoc.score();
    }
    normalDoc.hotMatchesOutOf(bestScore);

    // a link to the document in Solr is handy:
    console.log(normalDoc._url())
  }
});

Specifying search engine version number

Most of what splainer-search does should be compatible with all versions of Solr and Elasticsearch. There are times though where one of these projects introducing a breaking change and it becomes necessary to specify the version number used.

For example, ES deprecated the fields parameter in favor of stored_fields. So it's necessary to tell splainer-search which version you are using in order to send the appropriate request.

To do so you only need to specify the version number in the config param when constructing a new searcher:

Elasticsearch

var fields = fieldSpecSvc.createFieldSpec('id title price');
var options = {
  url:          'http://localhost:9200/tacos/_search',
  args:         {
    'query': {
      'match': {
        'title': '#$query##'
      }
    }
  },
  query:        'tacos',
  config:       { version: 5.1 },
  searchEngine: 'es' // or 'os' for OpenSearch
};
var searcher = searchSvc.createSearcher(fields, options.url, options.args, options.query, options.config, options.searchEngine);

searcher.search();

And splainer-search will take care of using the correct name in the parameters.

NB: The default ES config uses version 5.0 (see values/defaultESConfig.js), so 5.x-style behavior applies unless you override config.version (string or number is accepted).

Highlighting of results

If the individual search result field is a string then it is automatically highlighted.

However, if the selected value is an array or a JSON object, it doesn't coerce it to a string (and as a result doesn't highlight it, either).

Secondly, if any component in the selected path results in an array, the rest of the path is spread over the array value. To explain:

Data: { "variants": [ { "name": "red" }, { "name": "blue" } ] }
Path (or _field name_): "variants.name"
Result: [ "red", "blue" ]

Understanding query parameters

Sometimes we want to understand what queries are being sent to the search engine, and it can be a bit opaque if we are going through an API or if we have parameters being appended inside the search engine (think Paramsets in Solr or templates in ES). The property used for that is engine-specific (Solr uses searcher.queryDetails; see below).

Solr

Solr exposes effective request parameters on the response as responseHeader.params. Splainer-search copies that object onto searcher.queryDetails.

Send echoParams=all (or similar) to Solr so responseHeader.params is populated.

Elasticsearch/OpenSearch

There is no direct equivalent to Solr’s echoParams / responseHeader.params. For how the executed query was processed, use the profile response (see the next section): the library stores that under the same searcher.parsedQueryDetails property when profiling is present.

Understanding query input parsing

Frequently we want to understand what the search engine is doing to the raw query input. Consult the searcher.parsedQueryDetails property to get a search engine specific JSON data structure.

Solr

For Solr we copy keys from the debug section of the response into searcher.parsedQueryDetails, except track, timing, explain, and explainOther. Querqy-related fields from the top-level response may also be merged in (see Querqy below).

Elasticsearch/OpenSearch

In ES/OS we default to profile=true, and map everything nested under the profile key in the response to the searcher.parsedQueryDetails property.

Querqy Rules Library Support

Querqy is a query rewriting library. It helps you to tune your search results for specific search terms. Understanding what Querqy is doing to your queries is critical to achieving great search results.

Solr

The searcher.parsedQueryDetails property surfaces all the debugging information about what rewriting Querqy is doing to the input query. Assuming you are also requesting the details on what rules are being matched via the querqy.infoLogging=on query parameter, then you will also see that information in the searcher.parsedQueryDetails structure.

Developing this repo

Contributors need Node.js 20.12+ (Vitest 4 / tooling). See DEVELOPER_GUIDE.md for install, npm test, lint, build, and release notes.

Thanks to...

Development for this library is done primarily by OpenSource Connections for search relevance tools Splainer and Quepid

Original author is Doug Turnbull

Name		Name	Last commit message	Last commit date
Latest commit History 656 Commits
.circleci		.circleci
.cursor/rules		.cursor/rules
factories		factories
services		services
shims		shims
test		test
values		values
wired		wired
.eslintrc.cjs		.eslintrc.cjs
.gitignore		.gitignore
.nvmrc		.nvmrc
.prettierignore		.prettierignore
.prettierrc.json		.prettierrc.json
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
DEVELOPER_GUIDE.md		DEVELOPER_GUIDE.md
FUTURE.md		FUTURE.md
LICENSE.txt		LICENSE.txt
MIGRATION_CHANGES.md		MIGRATION_CHANGES.md
README.md		README.md
RELEASE_NOTES_3.0.0_DRAFT.md		RELEASE_NOTES_3.0.0_DRAFT.md
build.js		build.js
index.js		index.js
package-lock.json		package-lock.json
package.json		package.json
stryker.conf.json		stryker.conf.json
vitest.config.js		vitest.config.js
wired.js		wired.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Splainer Search

Splainer, Quepid, and other full apps

Basic usage

Solr

Elasticsearch and OpenSearch

Algolia

Vectara

Custom Search API

Paging

Explain Other

Solr

Elasticsearch / OpenSearch

Normalizing docs with normalDocs/fieldSpec

Specifying search engine version number

Elasticsearch

Highlighting of results

Understanding query parameters

Solr

Elasticsearch/OpenSearch

Understanding query input parsing

Solr

Elasticsearch/OpenSearch

Querqy Rules Library Support

Solr

Developing this repo

Thanks to...

About

Uh oh!

Releases 62

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Splainer Search

Splainer, Quepid, and other full apps

Basic usage

Solr

Elasticsearch and OpenSearch

Algolia

Vectara

Custom Search API

Paging

Explain Other

Solr

Elasticsearch / OpenSearch

Normalizing docs with normalDocs/fieldSpec

Specifying search engine version number

Elasticsearch

Highlighting of results

Understanding query parameters

Solr

Elasticsearch/OpenSearch

Understanding query input parsing

Solr

Elasticsearch/OpenSearch

Querqy Rules Library Support

Solr

Developing this repo

Thanks to...

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 62

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages