Indexing: enable Dataverse to use Solr in a distributed environment #1083

kcondon · 2014-11-04T21:57:04Z

Dataverse is currently coded to connect to solr on localhost. In order to run in a distributed environment, the app needs to be able to connect to a remote solr instance. Ideally this would be done with solr cloud and a zookeeper ensemble. A list of zookeepers would be stored somewhere which the app would use to initialize a CloudSolrServer object.

- added `SolrHostColonPort` config option - Updated Vagrant to allow spin up of multiple VMs

pdurbin · 2014-11-12T14:36:29Z

In 57dc52c I enabled a new SolrHostColonPort setting and documented it at https://github.com/IQSS/dataverse/blob/master/doc/Sphinx/source/Installers/dataverse-installer-main.rst#solrhostcolonport

Here's an example of how to use it:

curl -X PUT http://localhost:8080/api/s/settings/:SolrHostColonPort/localhost:8983

@kcondon is this good enough for Dataverse 4.0 or is the use of ZooKeeper a hard requirement? Can we defer ZooKeeper to a future release? I'll put this in QA so you can at least test this new setting.

kcondon · 2014-11-13T16:19:00Z

OK, this config param works. Will need to test on multi server env. As for whether it is ok for prod, maybe a discussion for others as well?

kcondon · 2014-11-17T19:38:42Z

Tested on multiple systems and they are both able to update index and see each other's updates. Do not know yet how it performs under load.

Benson still hopes we can use Zookeeper and thinks the config isn't that much more involved.

Closing as basic functionality has been delivered.

pdurbin · 2014-11-17T20:45:32Z

Benson still hopes we can use Zookeeper and thinks the config isn't that much more involved.

Right, he showed me we'd start Solr like this:

java -DzkRun -Dboostrap_confdir=solr/collection1/conf -Dbootstrap_conf=true -jar start.jar

At least that's how developers would start Solr with a single collection/core/thing. In production ZooKeeper would manage multiple.

…rses

kcondon added Type: Feature a feature request UX & UI: Design This issue needs input on the design of the UI and from the product owner Status: Dev labels Nov 4, 2014

kcondon assigned pdurbin Nov 4, 2014

kcondon added this to the Beta 9 - Dataverse 4.0 milestone Nov 4, 2014

pdurbin added a commit that referenced this issue Nov 10, 2014

Solr no longer required to run on localhost #1083

57dc52c

- added `SolrHostColonPort` config option - Updated Vagrant to allow spin up of multiple VMs

pdurbin assigned kcondon and unassigned pdurbin Nov 12, 2014

pdurbin added Status: QA and removed Status: Dev labels Nov 12, 2014

kcondon closed this as completed Nov 17, 2014

kcondon mentioned this issue Dec 3, 2014

Install: Add option for solr server location. #1200

Closed

pdurbin mentioned this issue Jul 7, 2015

Solr: load balancing, fault tolerance, and high availability #2322

Closed

bionary mentioned this issue Mar 2, 2016

2322-solrCloud integration #2985

Closed

11 tasks

plecor added a commit to plecor/dataverse that referenced this issue Dec 11, 2024

IQSS#1083: fix NPE in MyData when all dataverses are harvested datave…

48d0ab7

…rses

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Indexing: enable Dataverse to use Solr in a distributed environment #1083

Indexing: enable Dataverse to use Solr in a distributed environment #1083

kcondon commented Nov 4, 2014

pdurbin commented Nov 12, 2014

kcondon commented Nov 13, 2014

kcondon commented Nov 17, 2014

pdurbin commented Nov 17, 2014

Indexing: enable Dataverse to use Solr in a distributed environment #1083

Indexing: enable Dataverse to use Solr in a distributed environment #1083

Comments

kcondon commented Nov 4, 2014

pdurbin commented Nov 12, 2014

kcondon commented Nov 13, 2014

kcondon commented Nov 17, 2014

pdurbin commented Nov 17, 2014