<small id='ZuKDb'></small><noframes id='ZuKDb'>

  1. <tfoot id='ZuKDb'></tfoot>

    <legend id='ZuKDb'><style id='ZuKDb'><dir id='ZuKDb'><q id='ZuKDb'></q></dir></style></legend>
    • <bdo id='ZuKDb'></bdo><ul id='ZuKDb'></ul>
    <i id='ZuKDb'><tr id='ZuKDb'><dt id='ZuKDb'><q id='ZuKDb'><span id='ZuKDb'><b id='ZuKDb'><form id='ZuKDb'><ins id='ZuKDb'></ins><ul id='ZuKDb'></ul><sub id='ZuKDb'></sub></form><legend id='ZuKDb'></legend><bdo id='ZuKDb'><pre id='ZuKDb'><center id='ZuKDb'></center></pre></bdo></b><th id='ZuKDb'></th></span></q></dt></tr></i><div id='ZuKDb'><tfoot id='ZuKDb'></tfoot><dl id='ZuKDb'><fieldset id='ZuKDb'></fieldset></dl></div>
    1. 如何对用 lucene 索引的文档进行分类

      时间:2023-06-28
        <tbody id='MvWKO'></tbody>
            <bdo id='MvWKO'></bdo><ul id='MvWKO'></ul>
            <legend id='MvWKO'><style id='MvWKO'><dir id='MvWKO'><q id='MvWKO'></q></dir></style></legend>

                <i id='MvWKO'><tr id='MvWKO'><dt id='MvWKO'><q id='MvWKO'><span id='MvWKO'><b id='MvWKO'><form id='MvWKO'><ins id='MvWKO'></ins><ul id='MvWKO'></ul><sub id='MvWKO'></sub></form><legend id='MvWKO'></legend><bdo id='MvWKO'><pre id='MvWKO'><center id='MvWKO'></center></pre></bdo></b><th id='MvWKO'></th></span></q></dt></tr></i><div id='MvWKO'><tfoot id='MvWKO'></tfoot><dl id='MvWKO'><fieldset id='MvWKO'></fieldset></dl></div>
              1. <small id='MvWKO'></small><noframes id='MvWKO'>

              2. <tfoot id='MvWKO'></tfoot>
                本文介绍了如何对用 lucene 索引的文档进行分类的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着跟版网的小编来一起学习吧!

                问题描述

                我用 Lucene 对一组文档进行了分类(字段:内容、类别).每个文档都有自己的类别,但其中一些被标记为未分类.有没有什么方法可以在java中轻松分类这些文档?

                I have classified a set of documents with Lucene (fields: content, category). Each document has it's own category, but some of them are labeled as uncategorized. Is there any way to classify these documents easily in java?

                推荐答案

                从 Lucene 5.2.1 开始,您可以使用 索引文档以对新文档进行分类.开箱即用,Lucene 提供了一个朴素贝叶斯分类器,一个 k-最近邻分类器(基于 MoreLikeThis 类)和基于感知器的分类器.

                As of Lucene 5.2.1, you can use indexed documents to classify new documents. Out of the box, Lucene offers a naive Bayes classifier, a k-Nearest Neighbor classifier (based on the MoreLikeThis class) and a Perceptron based classifier.

                缺点是所有这些类都标有实验性警告,并附有维基百科的链接.

                The drawback is that all of these classes are marked with experimental warnings and documented with links to Wikipedia.

                这篇关于如何对用 lucene 索引的文档进行分类的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持跟版网!

                上一篇:Solr:结合 EdgeNGramFilterFactory 和 NGramFilterFactory 下一篇:在包含 1 亿个字符串的大型文本文件中进行高效的子字符串搜索(无重复字符串)

                相关文章

                <i id='F486A'><tr id='F486A'><dt id='F486A'><q id='F486A'><span id='F486A'><b id='F486A'><form id='F486A'><ins id='F486A'></ins><ul id='F486A'></ul><sub id='F486A'></sub></form><legend id='F486A'></legend><bdo id='F486A'><pre id='F486A'><center id='F486A'></center></pre></bdo></b><th id='F486A'></th></span></q></dt></tr></i><div id='F486A'><tfoot id='F486A'></tfoot><dl id='F486A'><fieldset id='F486A'></fieldset></dl></div>

                • <bdo id='F486A'></bdo><ul id='F486A'></ul>
              3. <legend id='F486A'><style id='F486A'><dir id='F486A'><q id='F486A'></q></dir></style></legend>

                <tfoot id='F486A'></tfoot>

                    <small id='F486A'></small><noframes id='F486A'>