solr - DeltaImport fetches all the data -


i'm indexing data database. i'm using delta import fetch updated data. however, find fetching whole data twice , processing once though changes applicable 1 row.

my config.xml deltaquery given:

<dataconfig> <datasource type="jdbcdatasource" driver="com.github.cassandra.jdbc.cassandradriver" url="jdbc:c*://127.0.0.1:9042/test" autocommit="true" rowlimit = '-1' batchsize="-1"/> <document name="content">     <entity name="test" query="select * person" deltaimportquery="select * person seq=${dataimporter.delta.seq}" deltaquery="select seq person last_modified > '${dataimporter.last_index_time}' allow filtering" autocommit="true">         <field column="seq" name="id" />         <field column="last" name="last_s" />         <field column="first" name="first_s" />         <field column="city" name="city_s" />         <field column="zip" name="zip_s" />         <field column="street" name="street_s" />         <field column="age" name="age_s" />         <field column="state" name="state_s" />         <field column="dollar" name="dollar_s" />         <field column="pick" name="pick_s" />     </entity> </document> </dataconfig> 

there 2100000 rows. cause large memory consumption resulting in running out of memory. problem? or work in way only?

if solr running out of memory time add more memory solr box. adding more ram alleviate issue.


Comments

Popular posts from this blog

networking - Vagrant-provisioned VirtualBox VM is not reachable from Ubuntu host -

c# - ASP.NET Core - There is already an object named 'AspNetRoles' in the database -

ruby on rails - ArgumentError: Missing host to link to! Please provide the :host parameter, set default_url_options[:host], or set :only_path to true -