For an overview see our Bixo introduction slides
Specific areas of documentation on this site are:
There is also a PDF of Ken Krugler’s talk at the Bay Area Hadoop User Group (HUG) September Meetup. This describes how to use Bixo to build a web mining application, and includes an example of mining the Hadoop mailing list archives to find helpful Hadoopers. The code for this is in the project’s GitHub repository under the contrib/helpful sub-directory.
- BIXO NEWS
- BIXO MAILING LIST
- Tika ConfigurationI’d like to replace the default HTML parser provided by Tika to use the boilerpipe version. I see that I could hard code it in the SimpleParser but that would
- Good results with 0.7.1Hi, I have been stepping through the webmining example for a few hours and am quite happy with what I have. I have noticed that KK has followed the same
- Re: 0.7.1 distribution availableHello all. Today I started with a freshly extracted bixo 0.7.1 archive download. bin/bixo crawl runs with no problems bin/bixo
- Re: 0.7.1 distribution available… This time I was working with the tar.gz distribution downloaded from the link you posted. I looked at the git repository and I failed to see recent
- Re: 0.7.1 distribution availableHi Michele, Once again, I’d like clarification whether the issue you have run into is with the distribution file or whether it is when you cloned the git repo
Theme: Vigilance by The Theme Foundry.