about statistics
Posted: Mon Jul 08, 2013 1:31 pm
something which i thought about and i don't really can understand it.
on imdb statistics there are shown 2,571,007 titles (it's a lot; but afterall not so many) and growing every day (but especially with new movies, all oldest are already exist (except of course foreign languages)). Our goal is to have the very same database but with added functionality (I assume that due to the fact that we can request only movies with imdb numbers). I'm right till here? But till now we only managed to "register" about 107,000 movies. When i say "WE", i consider myself too a part of this project since i use it it.
I don't have programing skills, but i had a little project where i collected some data from different sites and excell documents; and the guy who helped me to do this, told me that this process can be automatized. And a routine (macro, whatever) can register these data in my database without my interference. I only need to supervise it later when i correct aspects which i don't like ...
A remark on your help page indicate something similar, when you don't recommend to edit any request post due to the fact that the "robot" maybe already processed it.
now the BIG question: since you only process movies with imdb numbers, all imdb movies follows the same structure http://www.imdb.com/title/ttxxxxxxx/, why you not simply scan the whole imdb and offers a full database? why we need 213 pages of requests? when we can simply have only requests with covers, data corrections ... in a much smaller amount
what i miss; what i don't consider above?
on imdb statistics there are shown 2,571,007 titles (it's a lot; but afterall not so many) and growing every day (but especially with new movies, all oldest are already exist (except of course foreign languages)). Our goal is to have the very same database but with added functionality (I assume that due to the fact that we can request only movies with imdb numbers). I'm right till here? But till now we only managed to "register" about 107,000 movies. When i say "WE", i consider myself too a part of this project since i use it it.
I don't have programing skills, but i had a little project where i collected some data from different sites and excell documents; and the guy who helped me to do this, told me that this process can be automatized. And a routine (macro, whatever) can register these data in my database without my interference. I only need to supervise it later when i correct aspects which i don't like ...
A remark on your help page indicate something similar, when you don't recommend to edit any request post due to the fact that the "robot" maybe already processed it.
now the BIG question: since you only process movies with imdb numbers, all imdb movies follows the same structure http://www.imdb.com/title/ttxxxxxxx/, why you not simply scan the whole imdb and offers a full database? why we need 213 pages of requests? when we can simply have only requests with covers, data corrections ... in a much smaller amount
what i miss; what i don't consider above?