Can I install this software without abusing X10 service?

Status
Not open for further replies.

studio52

New Member
Messages
17
Reaction score
0
Points
1
I've been looking for a robust search engine, and I found this one, it looks nice and will work well with my flat 1.8 GB database of New Mexico case law that need's to be fully searchable to provide our nonprofit "access to justice" program with the case law to support this and our other exempt legal aid programs; the problem is it looks very resource intensive, the search engine software is here: "http://www.atnf.csiro.au/computing/software/arch/" I don't think we can use a mysql DB, unless anyone knows of a way to get 162 years worth of case law (avg. 150 files per year) into a DB or a CMS or wiki without spending the next two years doing data entry?

If you have any better solutions, please post them here!
 

essellar

Community Advocate
Community Support
Messages
3,295
Reaction score
227
Points
63
In a word, no. You'd need a VPS at a minimum or a dedicated/colocated server (or cluster of servers) to make it work. "Resource intensive" doesn't even begin to describe it (although Hadoop, Nutch and Solr are, in their own special way, very efficient in a "making the best of a bad situation" sort of way — "big data" is never going to be trivial). You also need to be able to run long-lasting processes and install binaries, neither of which are going to happen in a shared hosting environment. (Although you can find turn-key hosting that includes Hadoop, Nutch and Solr, so it's not necessarily all up to you — but it's usually at business-class prices, with an entry-level instance cost of around $50/mo.)

From a cost-effectiveness point of view, your best bet is probably to rely on Google or a similar search engine (you can integrate their search into your site in a way that stays within your site). To make that happen well, you'll need to be meticulous about creating and maintaining site maps. The main drawback is that you can't do local hinting; you're pretty much at the mercy of the search engine's "did you mean" algorithms, which (depending on the information field in question) can produce hilarious results from time to time.

I wish I could give you better news than "wow, that's going to cost you a bundle", but I can't. The lowest-cost option is going to be a VPS that you'll need to manage/keep secure, and the lowest-cost VPS option (at $8.95/mo) probably won't have enough CPU/memory grunt to run MapReduce (the core of the search/indexing functions) effectively on your data.
 
Status
Not open for further replies.
Top