Refactor code design #18

mihneadb · 2014-11-19T19:52:27Z

Need

The code should be nicely structured in order to make it easy to contribute.

Deliverables

code has a new structure that makes more sense given the ETag vs polling mechanisms introduced in Incremental loading of Github events and issues. #16
clean up debug logs
lay out the overall process for fetching and indexing changes in an abstract class, and let subclasses decide on how to do each step. More or less like template method pattern
do pagination by implementing an iterator
use an executor instead of a thread

Solution

TODO

mihneadb · 2014-11-19T19:52:50Z

cc @LosD feel free to come up with a suggestion and I'll edit the issue with your ideas.

LosD · 2014-11-20T07:56:57Z

I have a few, though some of them may not be worth the effort.

Lay out the overall process for fetching and indexing changes in an abstract class, and let subclasses decide on how to do each step. More or less like template method pattern.
Do pagination by implementing an iterator. Then the complexities of paginating are well hidden, making it easier to work with a new type (i.e. Git data). I don't believe there is anything specific to each entry type, but in that case it should still be possible to make subclasses for specifics.
Use an executor instead of a thread. That should make the River a little more resilient. Right now an unexpected exception will simply stop the thread, even with no exception in the logs. I know, because I screwed up an ES query :)

mihneadb · 2015-01-24T16:49:26Z

@LosD I'm thinking of rewriting the river [as soon as I find some time] in a better manner using Scala. Are you interested in hacking some Scala? :)

LosD · 2015-01-24T16:54:03Z

My Scala experience is rather limited (we use it for a few tests and analysis tools at my employer), but yeah :-)

On 24 January 2015 17:49:27 CET, Mihnea Dobrescu-Balaur [email protected] wrote:

@LosD I'm thinking of rewriting the river [as soon as I find some time]
in a better manner using Scala. Are you interested in hacking some
Scala? :)

Reply to this email directly or view it on GitHub:
#18 (comment)

LosD · 2015-05-07T15:05:40Z

Heads up: I've been at the ElasticSearch Core training today, and they warned me that rivers are deprecated. They will still be there in 2.0, but probably not much longer than that.

Logstash is now the expected way to get data into ES, so we might not want to put too much energy into updating the river. However, I know nothing about Logstash, so I don't know if it would even be viable for importing stuff from Github.

LosD · 2015-05-07T15:15:34Z

More info here https://www.elastic.co/blog/deprecating_rivers

mihneadb · 2015-05-08T13:12:50Z

Thanks for the info! :) Will keep that in mind.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor code design #18

Refactor code design #18

mihneadb commented Nov 19, 2014

mihneadb commented Nov 19, 2014

LosD commented Nov 20, 2014

mihneadb commented Jan 24, 2015

LosD commented Jan 24, 2015

LosD commented May 7, 2015

LosD commented May 7, 2015

mihneadb commented May 8, 2015

Refactor code design #18

Refactor code design #18

Comments

mihneadb commented Nov 19, 2014

Need

Deliverables

Solution

mihneadb commented Nov 19, 2014

LosD commented Nov 20, 2014

mihneadb commented Jan 24, 2015

LosD commented Jan 24, 2015

LosD commented May 7, 2015

LosD commented May 7, 2015

mihneadb commented May 8, 2015