Jump to content

User:Josh Parris/Text heavy dab pages

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by BD2412 (talk | contribs) at 04:55, 20 April 2010 (one done). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

These pages turned up in Category:All disambiguation pages

page_title bytes_per_link
Chandrabhaga 2724.0000
Franklin_Bridge 2697.8182
Infoshop_(disambiguation) 2618.0000
MISA 2276.0000
Enchantica_(band) 1645.3333
Cryan 840.3333
Odd_Fellows 806.7778
Malakand 764.8182
Four_Heavenly_Kings_(disambiguation) 741.7500
Muss 714.0000
Brahmi 693.2500
EDX 643.0000
Parangdo 624.6667
Polans 597.8000
Assumption_University 584.0000
Richland,_Kansas 582.1667
Red_Canyon 577.4000
Mollywood 577.3333
ETourism 575.0000
Aintiram 546.5000
The_Shakers 513.3333
Star_of_the_Sea_Church 513.3000 (seems fine)
Eziama 512.8750
Retrogames 500.6000
Collin 500.3750
LOBSTER 495.3333
SPIF 490.2857
Genographics_(disambiguation) 482.5714
UTSS 482.5000
Design_Science 481.2857
Overage 479.8000
Bascom 472.3200
Church_of_the_Firstborn 471.7500
Whole_number 469.8000
Bravery_Medal 469.7500
Lydon 451.0000
Retail_Price_Index 447.6667
Golden_Swallow 444.6667
Calvary_Christian_School 432.2500
Bell,_Oklahoma_(disambiguation) 429.5000
Neo-experimentalism 409.0000
Sri_Kumaran_Children's_Home,_Bangalore 402.5000
Angiras 396.4000
Nassi 394.5000
Modern_American_Usage 393.8000
Acid_attack_(disambiguation) 393.1667
Posol 392.5000
Federal_Reserve_Bank_Building 392.0000
Tadiran 391.5625
$5000 388.3333
Dubai_Duty_Free 387.0000
Treatment_effect 384.3333
Red_Rock_Canyon 381.5000
CCISD 374.0000
Limbu_Clans_and_Tribes 368.4848
DLB 363.6667
CADS 350.0000
EALA 345.2857
Theudas 343.5000
Raiding 342.3333
Sumpak 339.3333
Rocky_Lake_(Nova_Scotia) 337.2727
Curtiss_Hawk 332.5000
McGhee 330.0833
Beta_function 329.9200
Airtourer 329.3750
Posola 327.2500
Mud_Lake_(Wisconsin) 327.0612
$10000 326.2857
University_of_Louisiana 325.7143
The_Weblog_Awards 323.2500
Penex_(disambiguation) 321.0000
NVM 320.0000
Jan_Tęczyński 319.6667
Baghmara,_Bangladesh 316.6667
Dimethylhydrazine 316.3333
Archuleta 315.7143
Tzedek 315.0000
Hedda_Gabler_(film) 308.0000
Bonarda 305.2500
Kailash 304.5000
Moon_Records 302.5000
Bezanson 301.5000
Summit_Lake_(California) 300.7500
MicroTAS 297.5000
That's an awesome metric. I don't know what I didn't think of that. Should be a very useful list. I'm just thinking now of more similar metrics... something like sentences to bulleted entries... or something similar. Shadowjams (talk) 09:22, 23 January 2010 (UTC)
By the way, do you want people to remove, or strike entries as they're fixed? Shadowjams (talk) 09:24, 23 January 2010 (UTC)
I guess if they're fixed there's not much point in leaving them in. Josh Parris 10:06, 23 January 2010 (UTC)
  • I've started striking thru after cleaning, or if i notice one already done; IMO this provides a progress metric.
    --Jerzyt 06:07, 17 February 2010 (UTC)

Next time I run this, I ought to exclude/disclose the number of links. NVM is a dab page with one (or maybe two) entries. Josh Parris 10:13, 23 January 2010 (UTC)

Not sure how much granularity you have, but separating pages with < 2 links would be relevant (although probably mostly deletion or redirects). Perhaps a metric about blue to red links would be interesting. These are just ideas though. The better work is spent on dealing with the very good list you put together. Shadowjams (talk) 10:15, 27 January 2010 (UTC)
  • Don't exclude for small# of lks, tho it may (i haven't done any numbers) be worth estimating the typical overhead common to any Dab due to headings and footers and using a linear criterion rather than straight proportion, to get a somewhat lower ratio of false hits. And i presume you're aware of the short-page monitor/dummy-comment scheme that keeps minimal Dabs from clogging the list of short pages.
    --Jerzyt 06:07, 17 February 2010 (UTC)
    • There may be some value also in indicating how much text there is inside those links, as opposed to outside them. Some pages have excessive text because the links themselves are very long (especially where they are links to long-titled sections of long-titled pages). bd2412 T 02:00, 21 February 2010 (UTC)