Re: [Wikitech-l] Can we drop revision hashes (rev_sha1)?

15 Sep 2017


      Hi!
On 9/15/17 1:06 PM, Andrew Otto wrote:
...
...
As a random idea - would it be possible to calculate the hashes
when data is transitioned from SQL to Hadoop storage?
We take monthly snapshots of the entire history, so every month we’d
have to pull the content of every revision ever made :o
Why? If you already seen that revision in previous snapshot, you'd
already have its hash? Admittedly, I have no idea how the process works,
so I am just talking out of general knowledge and may miss some things.
Also of course you already have hashes from revs till this day and up to
the day we decide to turn the hash off. Starting that day, it'd have to
be generated, but I see no reason to generate one more than once?
-- 
Stas Malyshev
smalyshev@wikimedia.org

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] Can we drop revision hashes (rev_sha1)?