tf idf - Solr- Find "Significant Terms" on Subset of Documents -


i'm trying "significant terms" subset of documents in solr. may or may not best way, i'm attempting use solr's tf-idf functionality since have data stored in solr , it's lightning fast. want restrict "df" count subset of documents, through search or filter. tried this, i'm searching "apple" in name field:

http://localhost:8983/solr/techproducts/tvrh?q=name:apple&tv.tf=true&tv.df=true&tv.tf_idf=true&indent=on&wt=json&rows=1000

and of course, gives me documents have "apple" in name, document frequency gives counts entire dataset, doesn't seem want. think solr can this, maybe not. i'm open suggestions.

thanks, adrian

it 1 works have in backlog[1].

what need document frequency in foreground set ( subset of docs) , document frequency in background set(your corpus). solr won't out of box, can work on it. elastic search has module can inspiration from[2]

[1] https://issues.apache.org/jira/browse/solr-9851

[2] https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations-bucket-significantterms-aggregation.html


Comments

Popular posts from this blog

node.js - Node js - Trying to send POST request, but it is not loading javascript content -

javascript - Replicate keyboard event with html button -

javascript - Web audio api 5.1 surround example not working in firefox -