Table of Contents    
Year : 2010  |  Volume : 1  |  Issue : 2  |  Page : 126-127

Data transformation

Assistant Editor, JPP, Pondicherry, India

Date of Web Publication10-Nov-2010

Correspondence Address:
S Manikandan
Department of Pharmacology, Indira Gandhi Medical College and Research Institute, Kadirkamam, Pondicherry
Login to access the Email id

Source of Support: None, Conflict of Interest: None

DOI: 10.4103/0976-500X.72373

Rights and Permissions


How to cite this article:
Manikandan S. Data transformation. J Pharmacol Pharmacother 2010;1:126-7

How to cite this URL:
Manikandan S. Data transformation. J Pharmacol Pharmacother [serial online] 2010 [cited 2022 Jan 28];1:126-7. Available from:

Preparing the data facilitates statistical analysis and this includes data checking, computing-derived data from the original values, statistically adjusting for outliers and data transformation. The initial three methods have been explained previously in this series. [1] Data transformation also forms part of initial preparation of data before statistical analysis.

   When to do Transformation? Top

The pattern of values obtained when a variable is measured in large number of individuals is called a distribution. [2] Distribution can be broadly classified as normal and non-normal. The normal distribution is also called 'Gaussian distribution' as it was first described by K.F. Gauss. This is called normal distribution as most of the biological parameters (such as weight, height and blood sugar) follow it. There are a very few biological parameters which do not follow normal distribution, for example antibody titre, number of episodes of diarrhoea, etc. The beginners should not be confused with the term 'normal' as it does not necessarily imply clinical normality and there is nothing abnormal in the 'non-normal' distributions.

One of the assumptions of the statistical test used for testing hypothesis is that the data are samples from normal distribution.[3] Hence it becomes essential to identify skewed/normal distributions. There are some simple ways to detect skewness. [4]

  • If the mean is less than twice the standard deviation, then the distribution is likely to be skewed.
  • If the population follows normal distribution, then the mean and the standard deviation of the samples are independent. This fact can be used for detecting skewness. If the standard deviation increases as the mean increases across groups from a population, then it is a skewed distribution.

Apart from these simple methods, normality can be verified by statistical tests like Kolmogorov - Smirnov test.

Once skewness is identified, every attempt should be made to convert it into a normal distribution, so that the robust parametric tests can be applied for analysis. This can be accomplished by transformation.

Transformations can also be done for the ease of comparison and interpretation. The classical example of a variable which is always reported after logarithmic transformation is the hydrogen ion concentration (pH). Another example where transformation helps in the comparison of data is the logarithmic transformation of dose-response curve. When the dose-response relationship is plotted it is curvilinear. When the same response is plotted against log dose (log dose-response plot) it gives an elongated S-shaped curve. The middle portion of this curve is a straight line and comparing two straight lines (by measuring their slope) is easier than comparing two curves. Hence transformation can assist in the comparison of data.

In a nutshell, transformation can be carried out to make the data follow normal distribution or at times for ease of interpretation/comparison.

   Which Type of Transformation to USE? Top

Many a times, the transformation which makes the distribution normal also makes the variance equal. Even though there are many transformations like logarithm, square root, reciprocal, cube root, square, the initial three are more commonly used. The following are the guidelines for the selection of a method of transformation. [5]

  • If the standard deviation is proportional to the mean, the distribution is positively skewed and logarithmic transformation is the ideal one.
  • If the variance is proportional to the mean, square root transformation is preferred. This happens more in case of variables which are measured as counts e.g., number of malignant cells in a microscopic field, number of deaths from swine flu, etc.
  • If the standard deviation is proportional to the mean squared, a reciprocal transformation can be performed. Reciprocal transformation is carried out for highly variable quantities such as serum creatinine.

Among these three transformations, logarithmic transformation is commonly used as it is meaningful on back transformation (antilog). [3],[6]


A small cautionary note for the beginners performing transformation is that all calculations should be done in the transformed scale and back transformation should be done only at the end.

Many researchers think that transformation of data is 'data deceiving'. They are assured that transformation is a statistically approved method and it is universally valid.

   How to Report? Top

While reporting the results, the summary statistics of the raw data should be mentioned. The transformation done should be clearly mentioned along with the reason for transformation. One should not forget to mention that all the statistical analyses were carried out on the transformed data. [7] Finally the back transformation value (especially for 95% confidence interval) should also be mentioned.

   References Top

Manikandan S. Preparing to analyse data. J Pharmacol Pharmacother 2010;1:64-5.  Back to cited text no. 1
[PUBMED]  [Full text]  
Altman DG, Bland JM. Statistics notes: The normal distribution. BMJ 1995;310:298.  Back to cited text no. 2
[PUBMED]  [Full text]  
Bland JM, Altman DG. The use of transformation when comparing two means. BMJ 1996;312:1153.  Back to cited text no. 3
[PUBMED]  [Full text]  
Altman DG, Bland JM. Detecting skewness from summary information. BMJ 1996;313:1200.  Back to cited text no. 4
[PUBMED]  [Full text]  
Bland JM, Altman DG. Transforming data. BMJ 1996;312:770.  Back to cited text no. 5
[PUBMED]  [Full text]  
Bland JM, Altman DG. Transformations, means and confidence intervals. BMJ 1996;312:1079.  Back to cited text no. 6
[PUBMED]  [Full text]  
Swinscow TD, Campbell MJ. Statistics at square one. 10 th ed. (Indian). New Delhi: Viva Books Private limited; 2003.  Back to cited text no. 7

This article has been cited by
1 Variant connective tissue (joint hypermobility) and dysautonomia are associated with multimorbidity at the intersection between physical and psychological health
Jenny L. L. Csecs, Nicholas G. Dowell, Georgia K. Savage, Valeria Iodice, Christopher J. Mathias, Hugo D. Critchley, Jessica A. Eccles
American Journal of Medical Genetics Part C: Seminars in Medical Genetics. 2021;
[Pubmed] | [DOI]
2 Gender and Ethnic Variation in Emerging Adults’ Recalled Dating Socialization in Relation to Current Romantic Attitudes and Relationship Experiences
M. Anais Martinez, Brenda C. Gutierrez, May Ling D. Halim, Campbell Leaper
Sexuality & Culture. 2021; 25(6): 2208
[Pubmed] | [DOI]
3 Stress-Reducing Effects of Playing a Casual Video Game among Undergraduate Students
Veeral Desai, Arnav Gupta, Lucas Andersen, Bailey Ronnestrand, Michael Wong
Trends in Psychology. 2021; 29(3): 563
[Pubmed] | [DOI]
4 Statistical data presentation: a primer for rheumatology researchers
Durga Prasanna Misra, Olena Zimba, Armen Yuri Gasparyan
Rheumatology International. 2021; 41(1): 43
[Pubmed] | [DOI]
5 Effects of Humor and Bystander Gender on Responses to Antigay Harassment
Jennifer Katz, Dillon Federici, Dominique Brown
Journal of Homosexuality. 2021; : 1
[Pubmed] | [DOI]
6 Newborn hearing screening and early auditory-based treatment in Taiwan: action trends of families with children who are hearing impaired
Pei-Hua Chen, Tang-Zhi Lim
International Journal of Audiology. 2021; 60(7): 514
[Pubmed] | [DOI]
7 Review: Inflammation and anxiety-based disorders in children and adolescents – a systematic review and meta-analysis
Chelsea Parsons, Rachel Roberts, Natalie T. Mills
Child and Adolescent Mental Health. 2021; 26(2): 143
[Pubmed] | [DOI]
8 Computational Fluid Dynamics—Machine Learning Prediction of Machinery Coupling Windage Heating and Power Loss
Ahmad Dawahdeh, Joseph Oh, Tianbo Zhai, Alan Palazzolo
Journal of Heat Transfer. 2021; 143(8)
[Pubmed] | [DOI]
9 Pharmacometabonomics: data processing and statistical analysis
Jianbo Fu, Ying Zhang, Jin Liu, Xichen Lian, Jing Tang, Feng Zhu
Briefings in Bioinformatics. 2021; 22(5)
[Pubmed] | [DOI]
10 Parental Anxiety and Posttraumatic Stress Symptoms in Pediatric Food Allergy
Kate Roberts, Richard Meiser-Stedman, Alex Brightwell, Judith Young
Journal of Pediatric Psychology. 2021; 46(6): 688
[Pubmed] | [DOI]
11 Charged aerosol detector response modeling for fatty acids based on experimental settings and molecular features: a machine learning approach
Ruben Pawellek, Jovana Krmar, Adrian Leistner, Nevena Djajic, Biljana Otaševic, Ana Protic, Ulrike Holzgrabe
Journal of Cheminformatics. 2021; 13(1)
[Pubmed] | [DOI]
12 Natural Drying and Chemical Characteristics of Hybrid Poplar Firewood Produced from Agricultural Bioenergy Buffers in Southern Québec, Canada
Julien Fortier, Benoit Truax, Daniel Gagnon, France Lambert
Forests. 2021; 12(2): 122
[Pubmed] | [DOI]
13 The Predictive Role of Abdominal Fat Parameters and Stone Density on SWL Outcomes
Coskun Kaya, Yurdaer Kaynak, Aral Karabag, Aykut Aykaç
Current Medical Imaging Formerly Current Medical Imaging Reviews. 2020; 16(1): 80
[Pubmed] | [DOI]
14 Impact of Data Transformation: An ECG Heartbeat Classification Approach
Yongbo Liang, Ahmed Hussain, Derek Abbott, Carlo Menon, Rabab Ward, Mohamed Elgendi
Frontiers in Digital Health. 2020; 2
[Pubmed] | [DOI]
15 A Simple Assay to Assess Salmonella enterica Persistence in Lettuce Leaves After Low Inoculation Dose
Paula Rodrigues Oblessuc, Maeli Melotto
Frontiers in Microbiology. 2020; 11
[Pubmed] | [DOI]
16 Women at Greater Sexual Risk for STIs/HIV Have a Lower Mesolimbic and Affective Bias Response to Sexual Stimuli
Paul S. Regier, Anne M. Teitelman, Kanchana Jagannathan, Zachary A. Monge, Calumina McCondochie, Jaclynn Elkind, Anna Rose Childress
Frontiers in Behavioral Neuroscience. 2020; 13
[Pubmed] | [DOI]
17 Human Pathogen Colonization of Lettuce Dependent Upon Plant Genotype and Defense Response Activation
Cristián Jacob, Maeli Melotto
Frontiers in Plant Science. 2020; 10
[Pubmed] | [DOI]
18 Dimension- and context-specific expression of preschoolers' disruptive behaviors associated with prenatal tobacco exposure
Suena H. Massey, Caron A.C. Clark, Michael Y. Sun, James L. Burns, Daniel K. Mroczek, Kimberly A. Espy, Lauren S. Wakschlag
Neurotoxicology and Teratology. 2020; 81: 106915
[Pubmed] | [DOI]
19 Effects of endometritis on reproductive performance of zero-grazed dairy cows on smallholder farms in Rwanda
Pascal Nyabinwa, Olivier Basole Kashongwe, Claire d’Andre Hirwa, Bockline Omedo Bebe
Animal Reproduction Science. 2020; 221: 106584
[Pubmed] | [DOI]
20 The effect of sodium fluorescein on anterior eye surface measurements
Jeroen A. Mulder, Mirjam M. van Tilborg, Byki Huntjens
Contact Lens and Anterior Eye. 2020; 43(4): 402
[Pubmed] | [DOI]
21 Soil nutrient availability and microclimate are influenced more by genotype than by planting stock type in hybrid poplar bioenergy buffers on farmland
Julien Fortier, Benoit Truax, Daniel Gagnon, France Lambert
Ecological Engineering. 2020; 157: 105995
[Pubmed] | [DOI]
22 The impact of weight suppression and weight loss speed on baseline clinical characteristics and response to treatment
Marco Solmi,Davide Gallicchio,Enrico Collantoni,Paolo Meneguzzo,Tatiana Zanetti,Daniela Degortes,Elena Tenconi,Elisa Bonello,Angela Veronese,Andrea Ronzan,Angela Favaro
International Journal of Eating Disorders. 2018;
[Pubmed] | [DOI]
23 Chaos theory for clinical manifestations in multiple sclerosis
Tetsuya Akaishi,Toshiyuki Takahashi,Ichiro Nakashima
Medical Hypotheses. 2018; 115: 87
[Pubmed] | [DOI]
24 Short communication: High incubation temperature in bovine mammary epithelial cells reduced the activity of the mTOR signaling pathway
J.D. Kaufman,K.R. Kassube,R.A. Almeida,A.G. Ríus
Journal of Dairy Science. 2018;
[Pubmed] | [DOI]
25 Evaluation of dietary calcium level and source and phytase on growth performance, serum metabolites, and ileum mineral contents in broiler chicks fed adequate phosphorus diets from one to 28 days of age
T Momeneh,A Karimi,G Sadeghi,A Vaziry,M R Bedford
Poultry Science. 2018;
[Pubmed] | [DOI]
26 Significance, Errors, Power, and Sample Size
Edward J. Mascha,Thomas R. Vetter
Anesthesia & Analgesia. 2018; 126(2): 691
[Pubmed] | [DOI]
27 Weight Management Practices of Australian Olympic Combat Sport Athletes
Reid Reale,Gary Slater,Louise M. Burke
International Journal of Sports Physiology and Performance. 2018; : 1
[Pubmed] | [DOI]
28 Variability in gut mucosal secretory IgA in mice along a working day
Patricia Burns,Sofia Oddi,Liliana Forzani,Eduardo Tabacman,Jorge Reinheimer,Gabriel Vinderola
BMC Research Notes. 2018; 11(1)
[Pubmed] | [DOI]
Miodrag Stojanovic,Marija Andjelkovic - Apostolovic,Zoran Miloševic,Aleksandra Ignjatovic
Acta Medica Medianae. 2018; 57(2): 75
[Pubmed] | [DOI]
30 Fundamentals of Research Data and Variables
Thomas R. Vetter
Anesthesia & Analgesia. 2017; 125(4): 1375
[Pubmed] | [DOI]
31 Contextual interference during adaptation to asymmetric split-belt treadmill walking results in transfer of unique gait mechanics
Jacob W. Hinkel-Lipsker,Michael E. Hahn
Biology Open. 2017; 6(12): 1919
[Pubmed] | [DOI]
32 Assessment of evidence for nanosized titanium dioxide-generated DNA strand breaks and oxidatively damaged DNA in cells and animal models
Peter Møller,Ditte Marie Jensen,Regitze Sølling Wils,Maria Helena Guerra Andersen,Pernille Høgh Danielsen,Martin Roursgaard
Nanotoxicology. 2017; 11(9-10): 1237
[Pubmed] | [DOI]
33 Tap, swipe, and build: Parental spatial input during iPad® and toy play
Ariel Ho,Joanne Lee,Eileen Wood,Samantha Kassies,Carissa Heinbuck
Infant and Child Development. 2017; : e2061
[Pubmed] | [DOI]
34 Correlates of virtual navigation performance in older adults
Laura E. Korthauer,Nicole T. Nowak,Scott D. Moffat,Yang An,Laura M. Rowland,Peter B. Barker,Susan M. Resnick,Ira Driscoll
Neurobiology of Aging. 2016; 39: 118
[Pubmed] | [DOI]
35 Life is lognormal! What to do when your data does not follow a normal distribution
S. W. Choi
Anaesthesia. 2016; 71(11): 1363
[Pubmed] | [DOI]
36 Incidence and associated factors of difficult tracheal intubations in pediatric ICUs: a report from National Emergency Airway Registry for Children: NEAR4KIDS
Ana Lia Graciano,Robert Tamburro,Ann E. Thompson,John Fiadjoe,Vinay M. Nadkarni,Akira Nishisaki
Intensive Care Medicine. 2014;
[Pubmed] | [DOI]
37 Noncontrast computed tomography can predict the outcome of shockwave lithotripsy via accurate stone measurement and abdominal fat distribution determination
Jiun-Hung Geng,Hung-Pin Tu,Paul Ming-Chen Shih,Jung-Tsung Shen,Mei-Yu Jang,Wen-Jen Wu,Ching-Chia Li,Yii-Her Chou,Yung-Shun Juan
The Kaohsiung Journal of Medical Sciences. 2014;
[Pubmed] | [DOI]
38 When Ignorance is Bliss: Explicit Instruction and the Efficacy of CBM-A for Anxiety
Ben Grafton,Bundy Mackintosh,Tara Vujic,Colin MacLeod
Cognitive Therapy and Research. 2013;
[Pubmed] | [DOI]
39 Authoræs reply.
Manikandan S
J Pharmacol Pharmacother. 2011; 2(44): 45


Print this article  Email this article
    Similar in PUBMED
   Search Pubmed for
   Search in Google Scholar for
    Article in PDF (326 KB)
    Citation Manager
    Access Statistics
    Reader Comments
    Email Alert *
    Add to My List *
* Registration required (free)  

    When to do Trans...
    Which Type of Tr...
   How to Report?

 Article Access Statistics
    PDF Downloaded1157    
    Comments [Add]    
    Cited by others 39    

Recommend this journal