我想在Solr中的DataImporthandler中使用多个数据源,并在查询父实体中的数据库后传递子实体中的URL值

Jay*_*yku 4 solr

我想在Solr中的DataImporthandler中使用多个数据源,并在查询父实体中的数据库后传递子实体中的URL值.这是我的rss-data-config文件:

<dataConfig>
    <dataSource type="JdbcDataSource" name="ds-db" driver="com.mysql.jdbc.Driver" url="jdbc:mysql://localhost:3306/HCDACoreDB" 
                            user="root" password="CDA@318"/>
    <dataSource type="URLDataSource" name="ds-url"/>
    <document>
        <entity name="feeds" query="select f.feedurl, f.feedsource, c.categoryname from feeds f, category c where f.feedcategory = c.categoryid">

        <field column="feedurl" name="url" dataSource="ds-db"/>
        <field column="categoryname" name="category" dataSource="ds-db"/>

        <field column="feedsource" name="source" dataSource="ds-db"/>

        <entity name="rss"
                transformer="HTMLStripTransformer" 
                forEach="/RDF/channel | /RDF/item" 
                processor="XPathEntityProcessor" 
                url="${dataimporter.functions.encodeUrl(feeds.feedurl)}" > 

            <field column="source-link" dataSource="ds-url" xpath="/rss/channel/link" commonField="true" />
            <field column="Source-desc" dataSource="ds-url" xpath="/rss/channel/description" commonField="true" />
            <field column="title" dataSource="ds-url" xpath="/rss/channel/item/title" />
            <field column="link" dataSource="ds-url" xpath="/rss/channel/item/link" />
            <field column="description" dataSource="ds-url" xpath="/rss/channel/item/description" stripHTML="true"/>
            <field column="pubDate" dataSource="ds-url" xpath="/rss/channel/item/pubDate" />
            <field column="guid" dataSource="ds-url" xpath="/rss/channel/item/guid" />
            <field column="content" dataSource="ds-url" xpath="/rss/channel/item/content" />
            <field column="author" dataSource="ds-url" xpath="/rss/channel/item/creator" />
        </entity>

    </entity>
</document>
Run Code Online (Sandbox Code Playgroud)

我所做的是在名为feeds的第一个实体中,我正在查询数据库,并希望使用feedurl作为子实体名称rss的URL.

我运行dataimport时得到的错误是:java.net.MalformedURLException:no protocol:nullselect f.feedurl,f.feedsource,c.categoryname from feeds f,category c where f .feedcategory = c.categoryid

URL us NULL意味着它没有将feedur分配给URL.

关于我做错了什么的任何建议?

Tom*_*gli 5

这是一个例子:

<?xml version="1.0" encoding="UTF-8"?>
<dataConfig>
    <dataSource name="db1" ... />
    <dataSource name="db2"... />
    <document>
        <entity name="outer" dataSource="db1" query=" ... ">
            <field column="id" />
            <entity name="inner" dataSource="db2" query=" select from ... where id = ${outer.id} ">
                <field column="innercolumn" splitBy=":::" />
            </entity>
        </entity>
    </document>
Run Code Online (Sandbox Code Playgroud)

我们的想法是嵌套实体的一个定义,对另一个数据库进行额外的查询.

您可以访问像这样的父实体字段$ {outer.id}