Main content
Working with a linguistic corpus using R: An introductory note with Indonesian Negating Construction
Date created: 2018-07-03 02:05 AM | Last Updated: 2022-08-04 08:30 PM
Identifier: DOI 10.17605/osf.io/s6kh8
Category: Project
Description: The project contains the RMarkdown file and relevant dataset for a paper published as an open-access in Linguistik Indonesia, the flagship journal for the Indonesian Linguistic Society. The paper demonstrates the use of R, especially the RMarkdown notebook and RStudio, for a unified data science in Corpus Linguistics. To demonstrate, a case study on Indonesian negating construction is presented, based on data from the Indonesian Leipzig Corpora.
The webpage for Linguistik Indonesia can be accessed here.
To download the full paper, click here.
To download the Leipzig corpus file used in the paper (i.e. "ind_newscrawl_2012_1M-sentences.txt" [123MB]), click here. The corpus file has to be put in the same folder with the other data in this project so that it can be accessed when running all codes in the RMarkdown file.
Files
Files can now be accessed and managed under the Files tab.
Citation
Recent Activity
Unable to retrieve logs at this time. Please refresh the page or contact support@osf.io if the problem persists.