You might want to fiddle with $wgDBTableOptions.[0] The current
default is different from what you have.
[0]
Below is my LocalSettings.php w/ sensitive info (e.g.,
logins/pw) blanked
out.
Best Regards,
Krishna
<?php
# This file was automatically generated by the MediaWiki installer.
# If you make manual changes, please keep track in case you need to
# recreate them later.
#
# See includes/DefaultSettings.php for all configurable settings
# and their default values, but don't forget to make changes in _this_
# file, not there.
#
# Further documentation for configuration settings may be found at:
#
http://www.mediawiki.org/wiki/Manual:Configuration_settings
# If you customize your file layout, set $IP to the directory that contains
# the other MediaWiki files. It will be used as a base to locate files.
if( defined( 'MW_INSTALL_PATH' ) ) {
$IP = MW_INSTALL_PATH;
} else {
$IP = dirname( __FILE__ );
}
$path = array( $IP, "$IP/includes", "$IP/languages" );
set_include_path( implode( PATH_SEPARATOR, $path ) . PATH_SEPARATOR .
get_include_path() );
require_once( "$IP/includes/DefaultSettings.php" );
require_once( "{$IP}/extensions/NewUserNotif/NewUserNotif.php" );
#add extension to add <ref> tags
require_once( $IP.'/extensions/Cite/Cite.php' );
# If PHP's memory limit is very low, some operations may fail.
# ini_set( 'memory_limit', '20M' );
# make sure php can find mail api
ini_set ('include_path', get_include_path() . PATH_SEPARATOR .
'/usr/share/php/');
if ( $wgCommandLineMode ) {
if ( isset( $_SERVER ) && array_key_exists( 'REQUEST_METHOD',
$_SERVER
) ) {
die( "This script must be run from the command line\n" );
}
}
## Uncomment this to disable output compression
# $wgDisableOutputCompression = true;
#$wgDisableOutputCompression = false;
$wgSitename = "Hindupedia, the Hindu Encyclopedia";
## The URL base path to the directory containing the wiki;
## defaults for all runtime URL paths are based off of this.
## For more information on customizing the URLs please see:
##
http://www.mediawiki.org/wiki/Manual:Short_URL
$wgScriptPath = "/eng"; #virtual path (right part of the first
rewrite rule)
$wgScriptExtension = ".php";
$wgArticlePath = "/en/$1"; #virtual path (left part of first rewrite rule)
$wgUsePathInfo = true;
$wgEnableWriteAPI = true;
$wgEnableEmail = true;
$wgEnableUserEmail = true; # UPO
$wgEmergencyContact = "support(a)hindupedia.com".com";
$wgPasswordSender = "support(a)hindupedia.com".com";
## For a detailed description of the following switches see
##
http://www.mediawiki.org/wiki/Extension:Email_notification
## and
http://www.mediawiki.org/wiki/Extension:Email_notification
## There are many more options for fine tuning available see
## /includes/DefaultSettings.php
## UPO means: this is also a user preference option
$wgEnotifUserTalk = true; # UPO
$wgEnotifWatchlist = true; # UPO
$wgEmailAuthentication = true;
$wgDBtype = "mysql";
$wgDBname = "HindupediaMySQL";
$wgDBserver = "localhost";
$wgDBuser = ""; ##blanked out
$wgDBpassword = ""; ##blanked out
# MySQL specific settings
$wgDBprefix = "";
# MySQL table options to use during installation or update
$wgDBTableOptions = "TYPE=InnoDB";
# Experimental charset support for MySQL 4.1/5.0.
$wgDBmysql5 = false;
# Postgres specific settings
$wgDBport = "5432";
$wgDBmwschema = "mediawiki";
$wgDBts2schema = "public";
## Shared memory settings
$wgMainCacheType = CACHE_ACCEL; /* default: CACHE_NONE */
#$wgMemCachedServers = array();
## To enable image uploads, make sure the 'images' directory
## is writable, then set this to true:
$wgEnableUploads = true;
$wgUseImageMagick = true;
$wgImageMagickConvertCommand = "/usr/bin/convert";
## If you use ImageMagick (or any other shell command) on a
## Linux server, this will need to be set to the name of an
## available UTF-8 locale
$wgShellLocale = "en_US.UTF-8";
$wgUploadPath = "$wgScriptPath/images";
$wgUploadDirectory = "$IP/images";
#make links case in-sensitive
#$wgCapitalLinks = false;
$wgMimeDetectorCommand = "file -bi";
#mime type detection for pptx is incorrect...hence, disable mime type
checking for uploading it
#$wgVerifyMimeType = false;
srand(time());
$random = (rand()%4);
if ($random == 1)
{
$wgLogo = "$wgScriptPath/images/hindupediabanner400x73.gif";
}
else if ($random == 2)
{
$wgLogo = "$wgScriptPath/images/hindupediabanner2_400x73.gif";
}
else if ($random == 3)
{
$wgLogo = "$wgScriptPath/images/hp_banner_shreyans-1.gif";
}
else
{
$wgLogo = "$wgScriptPath/images/hp_banner_shreyans-2.gif";
}
## If you want to use image uploads under safe mode,
## create the directories images/archive, images/thumb and
## images/temp, and make them all writable. Then uncomment
## this, if it's not already uncommented:
# $wgHashedUploadDirectory = false;
## If you have the appropriate support software installed
## you can enable inline LaTeX equations:
$wgUseTeX = false;
$wgLocalInterwiki = $wgSitename;
$wgLanguageCode = "en";
$wgSecretKey = ""; ##blanked out
## Default skin: you can change the default skin. Use the internal symbolic
## names, ie 'standard', 'nostalgia', 'cologneblue',
'monobook':
#wfLoadSkin( 'CologneBlue' );
#wfLoadSkin( 'Modern' );
#wfLoadSkin( 'MonoBook' );
#wfLoadSkin( 'Vector' );
$wgDefaultSkin = 'monobook';
#$wgDefaultSkin = "gumax";
## For attaching licensing metadata to pages, and displaying an
## appropriate copyright notice / icon. GNU Free Documentation
## License and Creative Commons licenses are supported so far.
$wgEnableCreativeCommonsRdf = true;
$wgRightsPage = ""; # Set to the title of a wiki page that describes your
license/copyright
$wgRightsUrl = "http://www.gnu.org/copyleft/fdl.html";
$wgRightsText = "GNU Free Documentation License 1.2";
$wgRightsIcon = "${wgScriptPath}/skins/common/images/gnu-fdl.png";
# $wgRightsCode = "gfdl"; # Not yet used
$wgDiff = "/usr/bin/diff";
$wgDiff3 = "/usr/bin/diff3";
# When you make changes to this configuration file, this will make
# sure that cached pages are cleared.
$wgCacheEpoch = max( $wgCacheEpoch, gmdate( 'YmdHis', @filemtime( __FILE__
) ) );
# user/group permissions
$wgGroupPermissions['*']['edit'] = false;
$wgGroupPermissions['user']['edit'] = true;
# $wgGroupPermissions['confirmed']['edit'] = true;
#disable anonymous talk
$wgDisableAnonTalk = true;
#enable support for SVG image file type
$wgFileExtensions[] = 'svg';
$wgAllowTitlesInSVG = true;
#$wgSVGConverter = '/usr/share/ImageMagick';
$wgSVGConverters = array(
'ImageMagick' => '$path/convert -background white -thumbnail
$widthx$height\! $input PNG:$output',
'sodipodi' => '$path/sodipodi -z -w $width -f $input -e $output',
'inkscape' => '$path/inkscape -z -w $width -f $input -e $output',
'batik' => 'java -Djava.awt.headless=true -jar
$path/batik-rasterizer.jar -w $width -d $output $input',
'rsvg' => '/usr/bin/rsvg -w$width -h$height $input $output',
'imgserv' => '$path/imgserv-wrapper -i svg -o png -w$width $input
$output',
);
$wgSVGConverter = 'rsvg' ;
#enable support for MS PowerPoint
$wgFileExtensions[] = 'pptx';
$wgFileExtensions[] = 'ppt';
#enable support for AAC audio files
$wgFileExtensions[] = 'm4a';
#### Google Analytics
##Migration edit
require_once( "$IP/extensions/googleAnalytics/googleAnalytics.php" );
# Replace xxxxxxx-x with YOUR GoogleAnalytics UA number
$wgGoogleAnalyticsAccount = ""; ##blanked out
$wgGoogleAnalyticsIgnoreSysops = true;
$wgGoogleAnalyticsIgnoreBots = true;
# misc mediawiki settings
$wgEnableMWSuggest = true;
$wgShowIPinHeader = false;
# Set address bar icon
$wgFavicon = "/favicon.ico";
################### ConfirmAccount
#confirm user extension settings
###KKM update incompatible###
#include_once('extensions/ConfirmAccount/SpecialConfirmAccount.php');
$wgMakeUserPageFromBio = true;
$wgUseRealNamesOnly = true;
$wgAccountRequestMinWords = 50;
$wgAccountRequestExtraInfo = true;
$wgAllowAccountRequestFiles = true;
$wgConfirmAccountSaveInfo = true;
$wgConfirmAccountContact = "HindupediaSysop(a)hindupedia.com".com";
#StubManager Extension
require_once( "{$IP}/extensions/StubManager/StubManager.php" );
#Header Footer extension
require_once( "{$IP}/extensions/HeaderFooter/HeaderFooter.php" );
#Random Pages Extension
$namespace=true; #restrict extension to global namespace
###KKM update incompatible###
#require_once( "{$IP}/extensions/RandomPages/RandomPages.php" );
#Category Tree Extension
$wgUseAjax = true;
##Migration edit
#require_once( "{$IP}/extensions/CategoryTree/CategoryTree.php" );
#Description plugin - adds metadata from intro paragraph of each article to
help with search engine indexing
require_once("$IP/extensions/Description/Description.php");
#Add poem function for sanskrit text
require_once($IP.'/extensions/Poem/Poem.php');
#Add AutoLink
###KKM update incompatible###
#require_once("$IP/extensions/Autolink/Autolink.php");
#Add Social Bookmarking link
require_once("$IP/extensions/Bookmark/bookmark.php");
#Add Google Search
require_once("extensions/GoogleSiteSearch/GoogleSiteSearch.php");
#Add collist
require_once("extensions/collist/collist.php");
#add imagemap
#require_once("$IP/extensions/ImageMap/ImageMap.php");
#add dynamic page list
#KKM update fail
#require_once("$IP/extensions/DynamicPageList/DynamicPageList2.php");
#add parser functiosn (required for most templates)
##migration edit
require_once("$IP/extensions/ParserFunctions/ParserFunctions.php");
#add article to category
#require_once("$IP/extensions/articletocategory/articletocategory.php");
#add page black list extension
require_once("$IP/extensions/blacklist/blacklist.php");
$wgWhitelist['user']['read'] = $wgBlacklist['*']['read']
=
array("Special:RecentChanges", "Special:Listusers",
"Special:Ipblocklist",
"Special:Log", "Special:Allmessages", "User talk:Krishna
Maheshwari");
#add embed video extension (enable adding videos from popular sites like
google video, you tube, etc
#KKM update fail###
#require_once("$IP/extensions/EmbedVideo/EmbedVideo.php");
#if ($wgUser->isLoggedIn())
#{
#CharInsert -- allow clickable extra characters & tags for edit boxes
require_once("$IP/extensions/CharInsert/CharInsert.php");
#}
#rss feeds
#works but doesn't transclude source
#require_once("$IP/extensions/rss/rss.php");
#rss feeds - removed to see if it improves performance
#require_once("$IP/extensions/feedimport/feedimport.php");
#works but provides links to source instead of transcluding it
#require_once("$IP/extensions/simplefeed/SimpleFeed.php");
#Add Minify--which strips extra stuff from css & js files to make them
smaller
require_once("$IP/extensions/Minify/Minify.php");
#use ETags to facilitate caching at intermediary layers & the browser
$wgUseETag=true; /* default: false */
#server to send pages in to the browser in a compressed format
#$wgUseGzip=true; /* default: false */
#cache sidebar ...
#$wgSidebarCacheExpiry = 86400s; /*default*/
$wgEnableSidebarCache = true; /* default: false */
#enable client side caching
$wgCachePages = true; /* default: true */
#ParserCache
$wgEnableParserCache = true; /* default:false*/
$wgMainCacheType = CACHE_ANYTHING; /* default: CACHE_ANYTHING */
# Enable the basic file cache for static pages for non-logged-in visitors
$wgUseFileCache = true; /* default: false */
$wgFileCacheDirectory = "$IP/cache";
#default is 1, changing to a higher number will be a little bit
# nicer to the database
$wgHitcounterUpdateFreq = 100;
$wgExtraNamespaces[100] = "Quotes";
$wgExtraNamespaces[101] = "Quotes_talk";
$wgExtraNamespaces[102] = "Internal";
$wgExtraNamespaces[103] = "Internal_talk";
$wgExtraNamespaces[104] = "SupportUs";
$wgExtraNamespaces[105] = "SupportUs_talk";
#CSO_SiteAd - allow javascript inside protected wiki pages (ie
reciprocalURL)
#require_once("$IP/extensions/CSO_SiteAd/CSO_SiteAd.php");
####################### System Administration Extensions
#######################
##################### Enable only when needed, then disable
####################
#User Merge & Delete Extension
#KKM update fail###
#require_once( "$IP/extensions/UserMerge/UserMerge.php" );
$wgGroupPermissions['bureaucrat']['usermerge'] = true;
#optional - default is array( 'sysop' )
#$wgUserMergeProtectedGroups = array( 'groupname' );
#Usage Statistics
#KKM update fail###
#require_once( "{$IP}/extensions/UsageStatistics/SpecialUserStats.php" );
$wgUserStatsGoogleCharts = 1;
#SpecialLastUserLoginEx
#KKM update fail###
#require_once( "$IP/extensions/UserStats/SpecialLastUserLogin.php" );
$wgGroupPermissions['sysop']['lastlogin'] = true;
################################################################################
####################### Sending Email #######################
################################################################################
$wgSMTP = array(
'host' => 'ssl://smtp.gmail.com',
'IDHost' => 'hindupedia.com',
'port' => 465,
'username' => '', ##blanked out
'password' => '', ##blanked out
'auth' => true
);
#$wgReadOnly = 'This wiki is currently being upgraded to a newer software
version.';
#require_once( "$IP/extensions/JsonConfig/JsonConfig.php" );
require_once( "$IP/extensions/PayPal.php" );
require_once( "$IP/extensions/MiniDonation/MiniDonation.php" );
require_once( "$IP/extensions/IAST/IAST.php" );
$wgShowExceptionDetails = true;
$wgShowSQLErrors = true;
$wgDebugDumpSql = true;
$wgShowDBErrorBacktrace = true;
#Shows PHP errors
#error_reporting( -1 );
#ini_set( 'display_errors', 1 );
$wgReadOnly = 'This wiki is currently being upgraded to a newer software
version.';
$wgUpgradeKey = ''; ##blanked out
--------------------------------------------------------------------------------
Krishna Maheshwari
kmaheshwari(a)mba2007.hbs.edu
kkm9(a)cornell.edu
Hindupedia, the Hindu Encyclopedia (
www.hindupedia.com)
--------------------------------------------------------------------------------
On Mon, Nov 16, 2015 at 10:48 PM, kkm <kkm5848(a)gmail.com> wrote:
Continuing to dig into this...
Below are page table descriptions from mysql:
-----PAGE TABLE STATUS BEFORE UPGRADE-----
mysql> SHOW TABLE STATUS WHERE `NAME` LIKE '%page%';
+-------------------+--------+---------+------------+-------+----------------+-------------+-----------------+--------------+-----------+----------------+---------------------+-------------+------------+-----------------+----------+----------------+---------+
| Name | Engine | Version | Row_format | Rows |
Avg_row_length | Data_length | Max_data_length | Index_length | Data_free |
Auto_increment | Create_time | Update_time | Check_time |
Collation | Checksum | Create_options | Comment |
+-------------------+--------+---------+------------+-------+----------------+-------------+-----------------+--------------+-----------+----------------+---------------------+-------------+------------+-----------------+----------+----------------+---------+
| page | InnoDB | 10 | Compact | 12461 |
127 | 1589248 | 0 | 2162688 | 11534336 |
14232 | 2015-11-01 10:40:06 | NULL | NULL | utf8_general_ci
| NULL | | |
| page_props | InnoDB | 10 | Compact | 0 |
0 | 16384 | 0 | 0 | 11534336 |
NULL | 2015-11-01 10:40:07 | NULL | NULL | utf8_general_ci
| NULL | | |
| page_restrictions | InnoDB | 10 | Compact | 2 |
8192 | 16384 | 0 | 65536 | 11534336
| 3 | 2015-11-01 10:40:07 | NULL | NULL |
utf8_general_ci | NULL | | |
| pagelinks | InnoDB | 10 | Compact | 28377 |
56 | 1589248 | 0 | 1589248 | 11534336 |
NULL | 2015-11-01 10:40:07 | NULL | NULL | utf8_general_ci
| NULL | | |
+-------------------+--------+---------+------------+-------+----------------+-------------+-----------------+--------------+-----------+----------------+---------------------+-------------+------------+-----------------+----------+----------------+---------+
mysql> SHOW COLUMNS FROM page;
+-------------------+---------------------+------+-----+----------------+----------------+
| Field | Type | Null | Key | Default |
Extra |
+-------------------+---------------------+------+-----+----------------+----------------+
| page_id | int(10) unsigned | NO | PRI | NULL |
auto_increment |
| page_namespace | int(11) | NO | MUL | NULL
| |
| page_title | varchar(255) | NO | | NULL
| |
| page_restrictions | tinyblob | NO | | NULL
| |
| page_counter | bigint(20) unsigned | NO | | 0
| |
| page_is_redirect | tinyint(3) unsigned | NO | | 0
| |
| page_is_new | tinyint(3) unsigned | NO | | 0
| |
| page_random | double unsigned | NO | MUL | NULL
| |
| page_touched | binary(14) | NO | |
| |
| page_latest | int(10) unsigned | NO | | NULL
| |
| page_len | int(10) unsigned | NO | MUL | NULL
| |
+-------------------+---------------------+------+-----+----------------+----------------+
mysql> select page_id,page_title from page where page_title like 'Adhik%';
+---------+----------------------------+
| page_id | page_title |
+---------+----------------------------+
| 11974 | Adhikamasa |
| 11975 | Adhikamasa,_adhimasa |
| 7128 | AdhikamÄ sa |
| 6658 | AdhikamÄ sa,_adhimÄ sa |
| 11977 | Adhikara |
| 11978 | Adhikara_Nandi |
| 11976 | Adhikarana |
| 6660 | Adhikaraṇa |
| 6659 | AdhikÄ ra |
| 6661 | AdhikÄ ra_Nandi |
| 7126 | AdhikamÄ sa,_adhimÄ sa |
| 7117 | Adhikaraṇa |
| 7125 | AdhikÄ ra |
| 7118 | AdhikÄ ra_Nandi |
+---------+----------------------------+
-----PAGE TABLE STATUS AFTER UPGRADE-----
+-------------------+--------+---------+------------+-------+----------------+-------------+-----------------+--------------+-----------+----------------+---------------------+-------------+------------+-----------------+----------+----------------+---------+
| Name | Engine | Version | Row_format | Rows |
Avg_row_length | Data_length | Max_data_length | Index_length | Data_free |
Auto_increment | Create_time | Update_time | Check_time |
Collation | Checksum | Create_options | Comment |
+-------------------+--------+---------+------------+-------+----------------+-------------+-----------------+--------------+-----------+----------------+---------------------+-------------+------------+-----------------+----------+----------------+---------+
| page | InnoDB | 10 | Compact | 11883 |
133 | 1589248 | 0 | 2162688 | 111149056 |
14232 | 2015-11-01 10:40:06 | NULL | NULL | utf8_general_ci
| NULL | | |
| page_props | InnoDB | 10 | Compact | 0 |
0 | 16384 | 0 | 0 | 111149056 |
NULL | 2015-11-01 10:40:07 | NULL | NULL | utf8_general_ci
| NULL | | |
| page_restrictions | InnoDB | 10 | Compact | 2 |
8192 | 16384 | 0 | 65536 | 111149056
| 3 | 2015-11-01 10:40:07 | NULL | NULL |
utf8_general_ci | NULL | | |
| pagelinks | InnoDB | 10 | Compact | 28765 |
55 | 1589248 | 0 | 1589248 | 111149056 |
NULL | 2015-11-01 10:40:07 | NULL | NULL | utf8_general_ci
| NULL | | |
+-------------------+--------+---------+------------+-------+----------------+-------------+-----------------+--------------+-----------+----------------+---------------------+-------------+------------+-----------------+----------+----------------+---------+
mysql> select page_id,page_title from page where page_title like 'Adhik%';
+---------+----------------------------+
| page_id | page_title |
+---------+----------------------------+
| 11974 | Adhikamasa |
| 11975 | Adhikamasa,_adhimasa |
| 7128 | AdhikamÄ sa |
| 6658 | AdhikamÄ sa,_adhimÄ sa |
| 11977 | Adhikara |
| 11978 | Adhikara_Nandi |
| 11976 | Adhikarana |
| 6660 | Adhikaraṇa |
| 6659 | AdhikÄ ra |
| 6661 | AdhikÄ ra_Nandi |
| 7126 | AdhikamÄ sa,_adhimÄ sa |
| 7117 | Adhikaraṇa |
| 7125 | AdhikÄ ra |
| 7118 | AdhikÄ ra_Nandi |
+---------+----------------------------+
Not sure if this is correct, but I came across
http://product.hubspot.com/bid/7049/MySQL-and-Unicode-Three-Gotchas
which suggested that the page_title field could be improperly encoded (as
varchar 255 instead of utf8).
I tried to do
ALTER TABLE page MODIFY page_title VARCHAR(255) CHARACTER SET utf8;
But there are way too many conflicts/duplicates in the key page_name which
prevents me from doing it w/o dropping the index. BTW, if I do it, I can't
recreate it with the "UNIQUE" property which it has right now due to the
duplicates.
https://www.mediawiki.org/wiki/Manual:MWDumper
Suggests that the character type should be set to Binary to prevent these
types of issues...but not sure what I need to do to get there from where I
am right now.
Any ideas?
Thanks,
Krishna
--------------------------------------------------------------------------------
Krishna Maheshwari
Hindupedia, the Hindu Encyclopedia (
www.hindupedia.com)
--------------------------------------------------------------------------------
_______________________________________________
MediaWiki-l mailing list
To unsubscribe, go to:
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l