Database and scrapers for media center (XBMC, Boxee, ...)

  • Throughout the month of April 2024, participate in the FileJoker Thread Contest OPEN TO EVERYONE!

    From 1st to 30th of April 2024, members can earn cash rewards by posting Filejoker-Exclusive threads in the Direct-Downloads subforums.

    There are $1000 in prizes, and the top prize is $450!

    For the full rules and how to enter, check out the thread
  • Akiba-Online is sponsored by FileJoker.

    FileJoker is a required filehost for all new posts and content replies in the Direct Downloads subforums.

    Failure to include FileJoker links for Direct Download posts will result in deletion of your posts or worse.

    For more information see
    this thread.

timscampi

New Member
Jul 6, 2008
5
0
I've been adding all AV and IV I download to my media center, but I keep facing an annoying problem: there are no database to scrape content from for those types of movies.
You can find adult scrapers, but they only cover western movies.

I also have a sizable Japanese music collection and, again, I can't scrape most of it and I'm forced to enter information manually for a lot of stuff.

As such, I've been thinking about creating my own database and make scrapers for the software I use (XBMC)

The database would cover AV, IV, hentai movies, Japanese music and maybe other things. (But those are the main things I can't get from any "regular" scrapers.)

I'm not really advanced in PHP (But I can manage with the documentation.) and hosting will be an issue (I am still a student and hosting that allows adult content tend to be quite expensive.) but I started planning everything anyway.

I'm thinking about an XML based API.
Every entry (movie, album, ...) in the database would have a corresponding XML file generated each time a modification is made to the entry's information.
Each XML would have a unique URL that can be found using a php file that returns the URL. The scraper would use that page to get the URL, then get the XML and parse it for infos.
If the query (Name of the movie, album, ...) cannot be found, the script returns a list of partial matches.

As I said, I'm more of a beginner in PHP, even if I have experience in Python, Visual Basic and Pascal (At school and, now, university.)
As such, if you have ideas on how to implement it or how the database and scraper should work, or if you have any suggestion, go ahead :) (That's mainly why I'm posting this.)

Also, what would the interest for something like this ?
I know most people don't watch JAV and those who do probably don't put them on their media center, if they even have one ^^