main | non-english | preprints | 2012 | 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 | 1999 | 1998 | 1997 | 1996 | 1995 | 1994 | 1993 | 1992 | 1991 | 1990 | 1980s | 1970s | 1960s | 1950s | 1940s | 1930s | 1920s | 1910s | 1900s | 1890s |
2002

  1. LA Adamic, BA Huberman (2002), "Zipf's law and the internet", Glottometrics, 3:143-150.

  2. Gabriel Altmann (2002), "Zipfian linguistics", Glottometrics, 3:19-26.

  3. VK Balasubrahmanyan, S Naranan (2002), "Algorithmic information, complexity and Zipf´s law", Glottometrics, 4:1-26.

  4. Marcia J Bates (2002), "Speculations on browsing, directed searching, and linking in relation to the Bradford distribution", in Proceedings of the Fourth International Conference on Conceptions of Library and Information Science (COLIS 4), eds. Harry Bruce, Raya Fidel, Peter Ingwersen, Pertti Vakkari, pp.137-150 (Libraries Unlimited).

  5. Karl-Heinz Best (2002), "The distribution of rhythnic units in German short prose", Glottometrics, 3:136-142.

  6. RC Craft, C Leake (2002), "The Pareto principle in organizational decision making", Management Decision, 40(8):729-733.
    [Pareto's principle]

  7. DR Davis, DE Weinstein (2002), "Bones, bombs, and break points: the geography of economic activity", American Economic Review, 92:1269-1289.

  8. Lukasz Debowski (2002), "Zipf's law against the text size: a half-rational model", Glottometrics, 4:49-60.

  9. E Dellandrea, P Makris, N Vincent, M Boiron (2002), "A medical acoustic signal analysis method based on Zipf law", Proceedings of the 14th International Conference on Digital Signal Processing (DSP'2002), pp.615-618 (IEEE Press).

  10. G Caldarelli, A Capocci, P De Los Rios, MA Muñoz (2002), "Scale-free networks from varying vertex intrinsic fitness", Physical Review Letters, 89:258702.

  11. Holger Ebel, Lutz-Ingo Mielsch, Stefan Bornholdt (2002), "Scale-free topology of e-mail networks", Physical Review E, 66:035103.
    [ abstract]

  12. Gertraud Fenk-Oczlon, August Fenk (2002), "Zipf´s tool analogy and word order", Glottometrics, 5:22-28.

  13. Ramon Ferrer i Cancho, Ricard V Sole (2002), "Zipf's law and random texts", Advances in Complex Systems, 5(1):1-6.

  14. James C French (2002), "Modeling web data", Proceedings of the 2nd ACM/IEEE-CS Conference on Digital Libraries pp. 320-321 (ACM Press). [Abstract: We have created three testbeds of web data for use in controlled experiments in collection modeling. This short paper examines the applicability of Ziff's and Heaps' laws as applied to web data. We find extremely close agreement between observed vocabulary growth and Heaps' law. We find reasonable agreement with Zifp's law for medium to low frequency terms. Zifp's law is a poor predictor for high frequency terms. These findings hold for all three testbeds although we restrict ourselves to one here due to space limitations. ]

  15. Georg A. Gottwald, Matthew Nicol (2002), "On the nature of Benford's Law", Physica A, 303(3-4):387-396.
    [ abstract]
    [Benford's law]

  16. TR Gulden (2002), "Spatial and temporal patterns in civil violence: Guatemala, 1977-1986", Politics and the Life Science, 21(1):26-36.

  17. Le Quan Ha, EI Sicilia-Garcia, Ji Ming, FJ Smith (2002), "Extension of Zipf's law to words and phrases", Proceedings of 19th International Conference on Computational Linguistics (COLING'2002), pp.315-320.
    [PDF]

  18. P Harremoes, F Topsoe (2002), "Zipf's law, hyperbolic distributions and entropy loss", Proceedings of IEEE International Symposium on Information Theory, pp. 207-? (IEEE Press).

  19. Wolfgang Hilberg (2002), "The unexpected fundamental influence of mathematics upon language", Glottometrics, 5:29-50.

  20. DC Hoyle, M Rattray, R Jupp, A Brass (2002), "Making sense of microarray data distributions", Bioinformatics, 18(4):576-584.
    [Abstract: MOTIVATION: Typical analysis of microarray data has focused on spot by spot comparisons within a single organism. Less analysis has been done on the comparison of the entire distribution of spot intensities between experiments and between organisms. RESULTS: Here we show that mRNA transcription data from a wide range of organisms and measured with a range of experimental platforms show close agreement with Benford's law (Benford, PROC: Am. Phil. Soc., 78, 551-572, 1938) and Zipf's law (Zipf, The Psycho-biology of Language: an Introduction to Dynamic Philology, 1936 and Human Behaviour and the Principle of Least Effort, 1949). The distribution of the bulk of microarray spot intensities is well approximated by a log-normal with the tail of the distribution being closer to power law. The variance, sigma(2), of log spot intensity shows a positive correlation with genome size (in terms of number of genes) and is therefore relatively fixed within some range for a given organism. The measured value of sigma(2) can be significantly smaller than the expected value if the mRNA is extracted from a sample of mixed cell types. Our research demonstrates that useful biological findings may result from analyzing microarray data at the level of entire intensity distributions. ]
    [Benford's law]

  21. Ludek Hrebicek (2002), "Zipf's law and text", Glottometrics, 3:27-38.

  22. Guohua Jiang, Shi Shan, Lan Jiang, Xuesong Xu (2002), "A new rank-size distribution of Zipf"s Law and its applications", Scientometrics, 54(1):119-130.
    [Abstract: Developing the probability function to describe rank-size Zipfian phenomena, i.e., a form like P(R=r)= c/ra (a>0) with a rank type random variable R, has been an important problem in scientometrics and informetrics, In this article a newrank-size distribution of Zipf's law is presented and applied to an actual distribution of scientific productivities in Chinese universities. ]

  23. Reinhard Kohler (2002), "Power law models in linguistics: Hungarian", Glottometrics, 5:51-61.
    [Abstract: First, the status of Zipf(-Menzerath)'s Law and its criticisms are discussed, and the application of power law models, particularly in linguistics, is supported from a general point of view. The following sections, empirical studies on dependencies are conducted which test the Zipf-Mandelbrot Law, other power law models (Menzerath-Altmann's Law, the length-frequency dependency), and the word length distribution on data from Hungarian (a text and a dictionary). ]

  24. Andras Kornai (2002), "How many words are there", Glottometrics, 4:61-86.

  25. Victor Kromer (2002), "Zipf´s law and its modification possibilities", Glottometrics, 5:1-13.
    [Abstract: In this paper we consider the possibilities of known Zipf-Mandelbrot canonical law modifications. The proposed modifications explain the behavior of the right tail of the distribution and the presence of a deflection in the central part of the distribution (a crater). It is shown that the average word information load is invariant to the sample heterogeneity and that the proposed usage measure "places" the words more correctly with regard to their "importance". ]

  26. Vladmir A Kuznetsov (2002) "Statistics of the numbers of transcripts and protein sequences encoded in the genome", in Computational and Statistical Approaches to Genomics, pp.?-? (Kluwer).
    [ PDF]

  27. Mark Levene, Trevor Genner, George Loizou, Richard Wheeldon (2002), "A stochastic model for the evolution of the web", Computer Networks and ISDN Systems, 39:277-287.
    [PDF]

  28. Wentian Li (2002), "Zipf's law everywhere", Glottometrics, 5:14-21.
    [Abstract: At the 100th anniversary of the birth of George Kingsley Zipf, one striking fact about the statistical regularity that bears his name, Zipf's law, is that it seems to appear everywhere. We may ask these questions related to the ubiquity of Zipf's law: Is there a rigorous test in fitting real data to Zipf's law? In how many forms does Zipf's law appear? In which fields are the data sets claiming to exhibit Zipf's law? ]
    [PDF]

  29. Wentian Li, Yaning Yang (2002), "Zipf's law in importance of genes for cancer classification using microarray data", Journal of Theoretical Biology, 219:539-551.

  30. NM Luscombe, J Qian, Z Zhang, T Johnson, M Gerstein (2002), "The dominance of the population by a selected few: power-law behaviour applies to a wide variety of genomic properties", Genome Biology, 3:research0040.

  31. LC Malacarne, RS Mendes, EK Lenzi (2002), "q-exponential distribution in urban agglomeration", Physical Review E, 65(1):017106.
    [Abstract: Usually, the studies of distributions of city populations have been reduced to power laws. In such analyses, a common practice is to consider cities with more than one hundred thousand inhabitants. Here, we argue that the distribution of cities for all ranges of populations can be well described by using a q-exponential distribution. This function, which reproduces the Zipf-Mandelbrot law, is related to the generalized nonextensive statistical mechanics and satisfies an anomalous decay equation.]

  32. B McCowan, LR Doyle, SF Hanser (2002), "Using information theory to assess the diversity, complexity and development of communicative repertoires", Journal of Comparative Psychology, 116:166-172.

  33. Peter Meyer (2002), "Laws and theories in quantitative linguistics", Glottometrics, 5:62-80.
    [Abstract: According to a widespread conception, quantitative linguistics will eventually be able to explain empirical quantitative findings (such as Zipf's Law) by deriving them from highly general stochastic linguistic 'laws' that are assumed to be part of a general theory of human language (cf. Best (1999) for a summary of possible theoretical positions). Due to their formal proximity to methods used in the so-called exact sciences, theoretical explanations of this kind are assumed to be superior to the supposedly descriptive-only approaches of linguistic structuralism and its successors. In this paper I shall try to argue that on close inspection such claims turn out to be highly problematic, both on linguistic and on science-theoretical grounds. ]

  34. Marcelo A Montemurro, D Zanette (2002), "Frequency-rank distribution of words in large text samples: phenomenology and mode", Glottometrics, 4:87-98.

  35. Michael Nelson, J Stephen Downie (2002), "Informetric analysis of a music database", Scientometrics, 54(2):243-255.

  36. Geza Nemeth, Csaba Zainko (2002), "Multilingual statistical text analysis, Zipf''s law and Hungarian speech generation", Acta Linguistica Hungarica, 49(3-4):385-401.
    [PDF]

  37. David M Pennock, Gary W Flake, Steve Lawrence, Eric J Glover, C Lee Giles (2002), "Winners don't take all: Characterizing the competition for links on the web", Proceedings of National Academy of Sciences, 99(8):5207-5211.

  38. Claudia Prun and Robert Zipf (2002), "Biographical notes on G.K. Zipf", Glottometrics, 3:1-10.
    [Abstract: Harvard philologist George Kingsley Zipf has been underestimated by mainstream linguistics for the past half century. After short consideration of the significance of Zipf's work, this paper presents a personal account of Zipf's family background, carreer and private life by one of his sons. An extensive Bibliography is added. ]

  39. William J Reed (2002), "On the rank-size distribution for human settlements", Journal of Regional Science, 41:1-17.
    [ PDF ]

  40. WJ Reed, BD Hughes (2002), "On the size distribution of live genera", Journal of Theoretical Biology, 217(1):125-135.

  41. WJ Reed, BD Hughes (2002), "From gene families and genera to incomes and internet file sizes: why power laws are so common in nature", Physical Review E, 66:067103.

  42. Jeff Robbins (2002), "Technology, ease, and entropy: a testimonial to Zipf´s Principle of Least Effort", Glottometrics, 5:81-96.

  43. Thorsten Roelcke (2002), "Efficiency of communication. A new concept of language economy", Glottometrics, 4:27-38.
    [Abstract: George Kingsley Zipf is known not only as the "father" of language statistics or quantitative linguistics in general, but also as one of the first who discussed the phenomenon of linguistic economy in detail. The following discussion in linguistics and communication sciences shows a wide spread of more or less scientific grounded concepts. This great conceptual diversity however disturbs the scientific discussion further on. Hence in the following contribution a new concept of language economy that fulfils holistic (and atomistic) requirements will be shown. ]

  44. Ronald Rousseau (2002), "Lack of standardisation in informetric research. Comments on 'Power laws of research output. Evidence for journals of economics' by Matthias Sutter and Martin G. Kocher", Scientometrics, 55(2):317-327.

  45. Ronald Rousseau (2002), "George Kingsley Zipf: life, ideas, his law and informetrics", Glottometrics, 3:11-18.
    [Abstract: In this article we present a short biography of the linguist George Kingsley Zipf. We recall his work on the frequency of words in Chinese language and briefly discuss Zipf's principle of least effort. We mention his influence in the field of informetrics and end this contribution by highlighting some recent applications of Zipf's law in Internet research, geography and economics. ]

  46. Petr Savický and Jaroslava Hlavácová (2002), "Measures of word commonness", Journal of Quantitative Linguistics, 9(3):215-231
    [ abstract]

  47. S Solomon, P Richmond (2002), "Stable power laws in variable economies: Lotka-Volterra implies Pareto-Zipf", European Physical Journal B, 27:257-261.
    [Lotka's law, Pareto's principle]

  48. S Song, KH Zhang (2002), "Urbanisation and city size distribution in China", Urban Studies, 39:2317-2327.

  49. Yutaka Tachimori, Takashi Tahara (2002), "Clinical diagnoses following Zipf's law", Fractals, 10(3):341-351.
    [ abstract]

  50. Ludmila Uhlirova (2002), "Zipf's notion of 'economy' on the text level", Glottometrics, 3:39-60.

  51. Eric S Wheeler (2002), "Zipf´s law and why it works everywhere", Glottometrics, 4:45-48.

  52. YI Wolf, G Karev, EV Koonin (2002), "Scale-free networks in biology: new insights into the fundamentals of evolution", Bioesssays, 24(2):105-109.

  53. Xiangying Yang, Gustavo de Veciana (2002), "On Zipf law and effectiveness of hierarchical caching", Proceedings of Communication Networks and Distributed Systems 2002 (CNDS02), pp.?-?.