Linguistic Autopsy

Author Profiling Criteria and Methods

Authors

DOI:

https://doi.org/10.17986/blm.2019356618

Keywords:

.

Abstract

Objective: Different from authorship attribution, authorship profiling requires more elaborated investigations. This article deals with the authorship profiling criteria involving methods for the prediction of suspects' gender, age group, profession, native language, location, demography, personality traits, and mental health.

Materials and Methods: Authorship profiling elements involve the prediction of gender, age group, profession, native language, location, demography, personality traits, mental health, educational level, ethnicity, cultural level, social status, religious belief, political interests, and cultural values of a suspect by a thorough investigation of the text s/he produced. For these elements, which we refer to as authorship profiling criteria, several methods have been developed involving the analysis of the author’s part of speech usage, the suspected text’s readability features, lexical choice, errors, and topic specific features. The aforementioned criteria are of universal value, i.e. can be applied to any language, however, may occasionally vary from language to language and from writing system to writing system, depending on language family and typology.

Results and Discussion: The article specifically deals with the investigation of methods, such as the prediction of gender, age group, profession, native language, location, demography, personality traits, and mental health, concerning authorship profiling elements. For each element the accuracy rate of prediction varies 60-80%. In order to increase this rate, forensic linguists need to develop new methods, depending on the advances encountered in each subfield of linguistics. As it is considerably difficult to acquire original case material in Turkey, it is inevitable to do research with fictional data and corpus.

Downloads

Download data is not yet available.

References

Olsson J. Forensic Linguistics. London & New York: Continuum, 2008. ISBN-10: 1441170766. ISBN-13: 978-1441170767.

Coulthard M. & Johnson A. An Introduction to Forensic Linguistics: Language in Evidence. London & New York: Routledge, Taylor & Francis Group, 2007. ISBN-10: 1138641715. ISBN-13: 978-1138641716. DOI: https://doi.org/10.4324/9780203969717

Aslantürk O. TAMGACI: Artırımsal ve Geri Beslemeli Türkçe Yazar Çözümleme. Turkish Authorship Analysis With An Incremental And Adaptive Model. Doktora Tezi. Tez Danışmanı. Prof. Dr. Hayri SEVER. Ankara: Hacettepe Üniversitesi, 2014.

Coulthard M. Author Identification, Idiolect, and Linguistic Uniqueness. Applied Linguistics. In: Applied Linguistics, Volume 25, Issue 4, December 2004, pp. 431–447. Published: 01 December 2004. https://doi.org/10.1093/applin/25.4.431, 2004. DOI: https://doi.org/10.1093/applin/25.4.431

Coulthard M. The Routledge Handbook of Forensic Linguistics. New York: Taylor & Francis, 2010. ISBN-10: 0415837235. ISBN-13: 978-0415837231.

Grant T. Authorship Attribution in a Forensic Context. Yayımlanmamış Doktora Tezi. Unpublished Doctoral Thesis. Birmingham: Birmingham University, 2005.

McMenamin G. R. Forensic Linguistics – Advances in Forensic Stylistics. Boca Raton, London, New York, Washington D. C.: CRC Press, 2002. ISBN: 0849309662. DOI: https://doi.org/10.1201/9781420041170.ch9

Juola P. Authorship Attribution. Boston & Delft: now – The Essence of Knowledge, 2008. https://doi.org/10.1561/1500000005. DOI: https://doi.org/10.1561/9781601981196

Meina M., Brodzi´nska K., Celmer B., Czoków M., Patera M., Pezacki J., Wilk M. Ensemble-based Classification for Author Profiling Using Various Features Notebook for PAN at CLEF 2013. https://www.uni-weimar.de/medien/webis/events/pan-13/pan13-papers-final/pan13-author-profiling/meina13-notebook.pdf, 2013.

Stamatatos E. A Survey of Modern Authorship Attribution Methods. In: Journal of the American Society for Information Science and Technology, Vol. 60, Issue 3, March 2009, pp. 538-556, https://doi.org/10.1002/asi.21001, 2009. DOI: https://doi.org/10.1002/asi.21001

Şeker S. E. Veri Madenciliği – Bilgisayar Kavramları http://bilgisayarkavramlari.sadievrenseker.com/2013/07/01/jaccard-indeksi-jaccard-index/, 2013.

Koppel M., Argamon S., Shimoni A. R. Automatically Categorizing Written Texts by Author Gender. In: Literary and Linguistic Computing, Vol. 17, No. 4, pp. 401–412, https://doi.org/10.1093/llc/17.4.401, 2002. DOI: https://doi.org/10.1093/llc/17.4.401

Love H. Attributing Authorship. Cambridge: Cambridge University Press, 2002. DOI: https://doi.org/10.1017/CBO9780511483165

Argamon S., Koppel M., Fine J., Shimoni A. R. Gender, Genre, and Writing Style in Formal Written Texts. In: Text & Talk – An Interdisciplinary Journal of Language, Discourse and Communication Studies, Vol. 23, Issue 3, pp. 321–346. Ed. Sarangi Srikant. Ağ Erişimi: https://www.degruyter.com/view/j/text.1.2003.23.issue-3/text.2003.014/text.2003.014.xml. https://doi.org/10.1515/text.2003.014, 2006. DOI: https://doi.org/10.1515/text.2003.014

Reddy R., Vardhan V., Reddy V. A Survey on Authorship Profiling Techniques. In: International Journal of Applied Engineering Research, Volume 11, Number 5 (2016), pp. 3092-3102. ISSN 0973-4562. Research India Publications. http://www.ripublication.com, 2016.

Fellbaum C. WordNet: An Electronic Lexical Database. Cambridge: MIT Press, 1998. DOI: https://doi.org/10.7551/mitpress/7287.001.0001

Deerwester S., Dumais S., Furnas G.W., Landauer T. K., Harshman R. Indexing by Latent Semantic Analysis. In: Journal of the American Society for Information Science, 41(6), pp. 391-407, 1990. https://doi.org/10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9. DOI: https://doi.org/10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9

Küçükyılmaz T., Cambazoğlu B. B., Aykanat C., Can F. Chat Mining for Gender Prediction. In: Lecture Notes in Computer Science, Vol. 4243, p. 274283, 2006. DOI: https://doi.org/10.1007/11890393_29

Pennebaker J. W. The Secret Life of Pronouns: What Our Words Say About Us. Bloomsbury Press, 2011. DOI: https://doi.org/10.1016/S0262-4079(11)62167-2

Rangel F. & Rosso P. Use of Language and Author Profiling: Identification of Gender and Age. In: NLPCS 2013 CIRM – 10th International Workshop on Natural Language Processing and Cognitive Sciences, Marseille, France, 13-17 October 2013, pp. 177-186, 2013.

Rangel F., Rosso, P., Koppel M., Stamatatos, E., Inches G. Overview of the Author Profiling Task at PAN 2013. In: CEUR Workshop Proceedings Vol. 1179, Forner P., Navigli R., Tufis D. (eds.), Notebook Papers for PAN at CLEF 2013 LABs and Workshops, CLEF-2013, Valencia, Spain, September 23-26, https://CEUR-WS.org, 2013.

Nini A. Authorship Profiling in a Forensic Context. PhD Thesis. Birmingham: Aston University, 2014.

Ulaş H. Düşünce ve Dil Ölçeği Türkçe Formunun Şizofreni Hastalarında Türkçede Geçerlilik ve Güvenilirlik Çalışması. Uzmanlık Tezi. Dokuz Eylül Üniversitesi, Psikiyatri Anabilim Dalı, İzmir, 2005.

Songar A. Dil ve Düşünce. Cerrahpaşa Tıp Fakültesi Psikiyatri Kliniği Vakfı Yayınları. No: 7. İstanbul: Gür-Ay Matbaası, 1986.

Başkak B., Özel T. E., Atbaşoğlu E. C., Başkak S. Peculiar Word Use as a Possible Trait Marker in Schizophrenia. In: Schizophrenia Research, 2008; 103:311-317 https://doi.org/10.1016/j.schres.2008.04.025. DOI: https://doi.org/10.1016/j.schres.2008.04.025

Aksan D. Anlambilimi ve Türk Anlambilimi: Ana Çizgileriyle. Ankara: Ankara Üniversitesi Dil Tarih ve Coğrafya Fakültesi, 1971.

Aksan D. Anlambilim: Anlambilim Konuları ve Türkçenin Anlambilimi. Ankara: Engin Yayınevi, 1998.

Cruse A. D. Meaning in Language: An Introduction to Semantics and Pragmatics. Oxford: Oxford University Press, 2000.

Brinker K. Linguistische Textanalyse: Eine Einführung in Grundbegriffe und Methoden. 6th ed. Berlin: Erich Schmidt Verlag, 2005.

Daneš F. Zur linguistischen Analyse der Textstruktur. In: Folia Linguistica 4: 72-78, 1970. DOI: https://doi.org/10.1515/flin.1970.4.1-2.72

Brown G. & Yule G. Discourse Analysis. Cambridge: Cambridge University Press, 1993.

Hellmann C. The Notion of Coherence in Discourse. In: Focus and coherence in discourse Processing, pp. 190-202. Eds. Rickheit, Gert & Christopher Habel. Berlin & New York: de Gruyter, 1995.

Manschreck T. C., Maher B., Hoover T., Ames D. Repetition in Schizophrenic Speech. In: Language and Speech, Vol: 28, No: 3, Pp. 255-268. Kingston Press, 1985. DOI: https://doi.org/10.1177/002383098502800303

Kleiger J. H & Khadivi A. Assessing Psychosis: A Clinician’s Guide. New York & London: Routledge, Taylor & Francis Group, 2015. DOI: https://doi.org/10.4324/9781315882086

Andreasen N. C. Thought, Language, and Communication Disorders. I. Clinical Assessment, Definition of Terms, and Evaluation of Their Reliability. In: Archives of General Psychiatry, 1979 Nov; 36(12): 1315-1321, 1979a. DOI: https://doi.org/10.1001/archpsyc.1979.01780120045006

Andreasen N. C. Thought, Language, and Communication Disorders. II. Diagnostic significance. In: Archives of General Psychiatry, 1979 November; 36(12):1325-1330, 1979b. DOI: https://doi.org/10.1001/archpsyc.1979.01780120055007

Johnston M. H. & Holzman P. S. Assessing Schizophrenic Thinking. San Francisco: Jossey-Bass, 1979.

Marengo J. T., Harrow M., Lanin-Kettering I., Wilson A. Evaluating Bizarre-idiosyncratic Thinking: A Comprehensive Index of Positive Thought Disorder. In: Schizophrenia Bulletin 12(3):497-511. February 1986. https://doi.org/10.1093/schbul/12.3.497, 1986. DOI: https://doi.org/10.1093/schbul/12.3.497

Docherty N. M., DeRosa, Andreasen N. C. Communication disturbances in schizophrenia and mania. In: Archives of General Psychiatry, No: 53, pp. 358-364, 1996. DOI: https://doi.org/10.1001/archpsyc.1996.01830040094014

Docherty N. M. Communication Disturbances Index: Rating Manual, 1996. DOI: https://doi.org/10.1037/t39394-000

Chen E. Y. H., Lam, L. C. W., Kan C. S., Chan C. K. Y., Kwok C. L., Nguyen D. G. H., Chen R. Y. L. Language Disorganisation in Schizophrenia: Validation and Assessment with a New Clinical Rating Instrument. In: Hong Kong J. Psychiatry, 6 (1), pp. 4-13, 1996.

Olivier V., Hardy-Baylé M. C., Lancrenon S., Fermanian J., Sarfati Y., Passerieux C., Chevalier J. F. Rating Scale for the Assessment of Communication Disorders in Schizophrenics. In: European Psychiatry, Vol. 12, Issue 7, Ss. 352-361, 1997. DOI: https://doi.org/10.1016/S0924-9338(97)80005-3

Liddle P. F., Elton T. C., Stephanie L. Thought and Language Index: An Instrument for Assesing Thought and Language in Schizophrenia. In: Br J Psychiatry, 181: 326-330, 2002. DOI: https://doi.org/10.1192/bjp.181.4.326

Ulaş H., Alptekin K., Özbay D., Akdede B. B., Çakır E., Türmüklü M., Şimşek S., Arkar H., Akvardar Y. Düşünce ve Dil Ölçeğinin Türkçe Formunun Geçerlilik ve Güvenilirlik Çalışması. Klinik Psikiyatri. Sayı: 10, Ss. 77-85, 2007.

Barrera A., McKenna, P., Berrios, G. E. Two New Scales of Formal Thought Disorder in Schizophrenia. In: Psychiatry Research 157(1-3):225-34, February 2008, https://doi.org/10.1016/j.psychres.2006.09.017, 2008. DOI: https://doi.org/10.1016/j.psychres.2006.09.017

Kircher T., Krug A., Stratmann M., Ghazi S., Schales Ch., Frauenheim M., Turner L., Fährmann P., Hornig T., Katzev M., Grosvald M., Müller-Isberner R., Nagels A. A Rating Scale for the Assessment of Objective and Subjective Formal Thought and Language Disorder (TALD). In: Schizophrenia Research, Volume 160, Issues 1-3, December 2014, pp. 216-221. https://doi.org/10.1016/J.SCHRES.2014.10.024, 2014. DOI: https://doi.org/10.1016/j.schres.2014.10.024

Mutlu E., Yazıcı M. K., Barışkın E., Ertuğrul A., Gürel Ş. C., Gürkan, Ş., Göka E. Yağcıoğlu A. E. A. Examination of Formal Thought Disorder and its Clinical Correlates with the Turkish Version of The Thought and Language Disorder Scale (TALD-TR) in Schizophrenia. In: Comprehensive Psychiatry, Volume 93, 2019, pp. 7-13, ISSN 0010-440X, https://doi.org/10.1016/j.comppsych.2019.06.003, 2019. DOI: https://doi.org/10.1016/j.comppsych.2019.06.003

Downloads

Published

2019-12-29

Issue

Section

Research Article

How to Cite

1.
Karaman B İlkay. Linguistic Autopsy: Author Profiling Criteria and Methods. Bull Leg Med [Internet]. 2019 Dec. 29 [cited 2025 Sep. 6];24(3):214-25. Available from: https://www.adlitipbulteni.com/index.php/atb/article/view/1342