We are three graduate students at UC Berkeley, and we are currently working on a machine learning project for a class that we’re taking.
1) We found views with a size of -1 or 0. Does this mean the page doesn’t exist?
2) We found some articles have `size` that widely varies throughout the hourly snapshots of a day. Is that legitimate, or is there something odd with the data?