1. Introduction and background
1.1. ESG investment market development and current challenges
The global ESG investment landscape has experienced transformative growth, with assets under management reaching unprecedented levels across institutional and retail investor segments. Market dynamics reveal substantial capital flows directed toward sustainable investment strategies, driven by regulatory mandates, stakeholder pressure, and evolving investor preferences. Traditional ESG evaluation methodologies rely heavily on standardized rating agencies and periodic reporting mechanisms, creating information asymmetries and temporal delays in decision-making processes.
Contemporary challenges in ESG investing stem from the heterogeneity of data sources, inconsistency in rating methodologies, and the difficulty of processing real-time information flows. Cao and Sun [1] demonstrate that data-driven analytical frameworks can address these limitations by leveraging machine learning approaches for ESG-based stock investment analytics. The complexity of ESG information processing necessitates sophisticated technological solutions capable of handling unstructured data formats and extracting meaningful insights for investment applications.
Information processing inefficiencies in current ESG frameworks result in suboptimal allocation decisions and missed investment opportunities. The proliferation of digital media platforms and online financial publications has created an abundance of ESG-related content that remains largely underutilized in systematic investment processes. Kim et al.[2] reveal differential impacts of environmental, social, and governance news sentiment on corporate financial performance, highlighting the potential value of granular sentiment analysis in investment decision-making.
1.2. Natural language processing applications in financial decision making
Natural language processing technologies have revolutionized financial information processing, enabling automated extraction and analysis of textual data from diverse sources. Advanced NLP techniques facilitate the transformation of unstructured financial communications into quantifiable metrics suitable for algorithmic trading and portfolio management applications. Machine learning models demonstrate superior performance in sentiment classification tasks, particularly when applied to domain-specific financial vocabularies and contextual frameworks.
The integration of NLP methodologies in financial markets extends beyond simple sentiment classification to encompass complex semantic analysis, entity recognition, and temporal pattern identification. Zeidan [3] conducted sentiment analysis on 13,000 messages from finance professionals, revealing behavioral insights that influence ESG investment acceleration patterns. The application of transformer-based language models and deep learning architectures has enhanced the accuracy and sophistication of financial text analysis capabilities.
Financial institutions increasingly adopt NLP-driven analytics to gain competitive advantages in market timing, risk assessment, and alpha generation strategies. The real-time processing capabilities of modern NLP systems enable immediate response to market-moving information and sentiment shifts. Zhang et al.[4] innovate ESG reporting optimization through E-BERT models, demonstrating the practical applicability of advanced language processing techniques in sustainable finance contexts.
1.3. Research motivation and objectives
This research addresses the critical gap between abundant ESG-related textual information and its systematic utilization in investment portfolio construction. The motivation stems from the observed disconnect between ESG news sentiment and its quantitative integration into portfolio optimization frameworks. Jaiswal et al.[5] decode Twitter sentiment regarding ESG investing through opinion mining and machine learning techniques, revealing the untapped potential of social media analytics in sustainable finance applications.
The primary objective involves developing a comprehensive methodology that bridges natural language processing capabilities with portfolio management requirements. The research aims to establish quantitative relationships between ESG news sentiment patterns and subsequent portfolio performance outcomes. Secondary objectives include the creation of robust preprocessing pipelines, the development of multi-dimensional sentiment scoring mechanisms, and the validation of performance improvements through rigorous empirical testing.
The study contributes to both academic literature and practical investment management by providing evidence-based insights into the value creation potential of sentiment-driven ESG strategies. The research framework enables scalable implementation across different market segments and asset classes, supporting the broader adoption of technology-enhanced sustainable investing approaches.
2. Literature review and related work
2.1. ESG scoring methodologies and information sources
Contemporary ESG evaluation frameworks exhibit substantial variation in scoring methodologies, data sources, and temporal coverage, creating challenges for consistent implementation across investment strategies. Traditional approaches rely on corporate self-reporting mechanisms, third-party rating agencies, and standardized questionnaires that may not capture real-time market sentiment or emerging ESG trends. Fischbach et al.[6] present automatic ESG assessment methodologies through media coverage data mining and evaluation, demonstrating alternative approaches to conventional rating systems.
Academic research reveals significant divergence among major ESG rating providers, with correlation coefficients between different agencies often falling below 0.6, indicating substantial measurement inconsistencies. The reliance on backward-looking data in traditional ESG scoring creates temporal misalignments with market dynamics and limits the predictive value of ESG metrics in investment applications. Goutte et al.[7] explore ESG investing through sentiment analysis approaches, highlighting the potential for real-time information processing to enhance traditional scoring methodologies.
Alternative data sources, including satellite imagery, social media content, regulatory filings, and news articles, offer complementary information streams that can augment conventional ESG evaluation frameworks. The integration of multiple data modalities requires sophisticated analytical capabilities and standardized processing protocols to ensure consistency and reliability. Babaeva [8] studies quantifying ESG alpha through sentiment analysis in the banking sector, providing sector-specific insights into the practical application of alternative data sources.
2.2. Sentiment analysis techniques in financial news processing
Financial news sentiment analysis encompasses diverse methodological approaches, ranging from lexicon-based classification systems to sophisticated deep learning architectures designed for domain-specific applications. Early approaches utilized dictionary-based methods and rule-based systems that relied on predefined financial vocabulary and sentiment scoring mechanisms. Contemporary research emphasizes machine learning techniques that can adapt to evolving language patterns and contextual nuances in financial communications.
Transformer-based models, particularly BERT variants and their financial domain adaptations, demonstrate superior performance in capturing semantic relationships and contextual dependencies within financial text data. The development of specialized financial language models addresses domain-specific terminology, metaphorical expressions, and temporal sensitivity inherent in financial communications. Andrikogiannopoulou et al.[9] analyze discretionary information in ESG investing through text analysis of mutual fund prospectuses, revealing the complexity of financial text interpretation in ESG contexts.
Advanced sentiment analysis frameworks incorporate multi-dimensional classification schemes that distinguish between different sentiment aspects, temporal persistence, and entity-specific associations. The integration of named entity recognition, dependency parsing, and semantic role labeling enhances the granularity and accuracy of sentiment extraction from complex financial documents. Zhang [10] develops ESG-based market risk prediction using machine learning and natural language processing, demonstrating the practical value of sophisticated text analysis techniques in risk management applications.
2.3. Portfolio optimization strategies incorporating alternative data
Modern portfolio optimization frameworks increasingly integrate alternative data sources to enhance return generation and risk management capabilities. Traditional mean-variance optimization approaches face limitations when incorporating high-dimensional alternative datasets, necessitating advanced regularization techniques and dimensionality reduction methods. The integration of sentiment-derived signals requires careful consideration of data quality, temporal alignment, and signal persistence to avoid overfitting and ensure robust performance.
Machine learning approaches to portfolio optimization, including reinforcement learning and deep neural networks, offer sophisticated mechanisms for incorporating complex alternative data relationships. These methodologies can capture non-linear dependencies and temporal patterns that traditional optimization techniques may overlook. Jatowt [11] predicts ESG scores from news articles using natural language processing, providing insights into the predictive capabilities of text-based financial analysis.
Factor-based portfolio construction methodologies provide structured frameworks for incorporating sentiment-derived signals alongside traditional risk and return factors. The implementation of multi-factor models enables the systematic evaluation of sentiment contributions to portfolio performance while maintaining diversification and risk control objectives. Dynamic allocation strategies that respond to sentiment signal changes require robust backtesting frameworks and risk management protocols to ensure practical applicability.
3. Methodology and technical framework
3.1. ESG news data collection and preprocessing pipeline
The data collection framework encompasses multiple financial news sources, including Reuters, Bloomberg, Financial Times, and specialized ESG publications, ensuring comprehensive coverage of ESG-related information across different market segments and geographical regions. The automated data acquisition system operates continuously, processing approximately 50 news articles daily through API integrations and web scraping mechanisms. Wang and Zhu [12] analyze linguistic patterns in academic abstracts, providing insights into text processing methodologies that inform the preprocessing pipeline design.
The preprocessing pipeline implements sophisticated text cleaning algorithms that preserve semantic content while removing noise, advertisements, and irrelevant formatting elements. Named entity recognition modules identify companies, sectors, and ESG-specific terminology using custom-trained models based on financial vocabularies and ESG taxonomies. Temporal alignment algorithms ensure precise matching between news publication timestamps and corresponding market trading sessions to maintain analytical accuracy.
Data quality assurance mechanisms include duplicate detection, source reliability scoring, and content relevance filtering to maintain high-quality datasets for analysis. Table 1 provides comprehensive statistics on data collection sources and their distribution characteristics. The preprocessing system implements language detection algorithms to handle multilingual content and applies standardized text normalization procedures. Liu et al.[13] examine algorithmic bias identification in machine learning applications, informing bias mitigation strategies implemented throughout the data processing pipeline.
|
Data Source |
Articles Count |
Percentage |
Average Daily Volume |
Reliability Score |
|
Reuters |
17,748 |
32.4% |
16.2 |
0.892 |
|
Bloomberg |
15,717 |
28.7% |
14.3 |
0.895 |
|
Financial Times |
10,841 |
19.8% |
9.9 |
0.886 |
|
ESG Publications |
10,444 |
19.1% |
9.5 |
0.801 |
|
Total |
54,750 |
100.0% |
49.9 |
0.847 |
Geographic Distribution:
North America: 24,743 articles (45.2%)
Europe: 17,301 articles (31.6%)
Asia-Pacific: 12,706 articles (23.2%)
Temporal Coverage: 3 years (January 2021 December 2023)
3.2. Multi-dimensional sentiment analysis model development
The sentiment analysis architecture employs a hierarchical classification system that processes ESG content across three primary dimensions: environmental impact assessment, social responsibility evaluation, and governance quality measurement. The model utilizes pre-trained BERT-based transformers fine-tuned on financial and ESG-specific corpora to capture domain-specific semantic relationships and contextual dependencies. Mo et al.[14] compare large language model performance in code defect identification, providing insights into model selection criteria that inform the sentiment analysis framework design.
Environmental sentiment classification incorporates specialized vocabularies related to climate change, resource management, and sustainability initiatives, utilizing custom embedding layers trained on environmental science literature and corporate sustainability reports. Social dimension analysis focuses on labor practices, community relations, and diversity metrics through sentiment scoring mechanisms calibrated against established social impact measurement frameworks. Governance sentiment extraction emphasizes board composition, executive compensation, and regulatory compliance through sophisticated entity relationship mapping and sentiment attribution algorithms.
The multi-dimensional architecture implements attention mechanisms that weight different sentiment aspects based on their relevance to specific industries and company characteristics. Cross-dimensional interaction layers capture interdependencies between environmental, social, and governance factors, enabling nuanced sentiment scoring that reflects complex ESG relationships.As illustrated in Figure 1, the multi-dimensional ESG sentiment analysis architecture demonstrates the sophisticated integration of these components. Xu [15] develops intelligent optimization algorithms using generative adversarial networks, informing advanced architectural decisions in the sentiment analysis model design.
The sentiment analysis architecture diagram illustrates a sophisticated three-layer neural network structure with specialized processing branches for environmental, social, and governance dimensions. The input layer processes tokenized text through BERT-based embeddings, feeding into parallel processing streams that handle domain-specific sentiment classification. Attention mechanisms connect the three dimensional branches, enabling cross-domain information sharing and contextual sentiment refinement. The final output layer aggregates multi-dimensional sentiment scores through weighted combination algorithms, producing comprehensive ESG sentiment metrics for portfolio optimization applications.
|
ESG Dimension |
Accuracy (%) |
Precision |
Recall |
F1-Score |
Training Samples |
|
Environmental |
87.3 |
0.884 |
0.862 |
0.873 |
28,450 |
|
Social |
89.1 |
0.903 |
0.879 |
0.891 |
31,220 |
|
Govemance |
85.7 |
0.867 |
0.848 |
0.857 |
25,180 |
|
Sentiment Type |
Precision |
Recall |
F1-Score |
Support |
|
Positive |
0.895 |
0.887 |
0.891 |
12,847 |
|
Neutral |
0.882 |
0.866 |
0.874 |
15,923 |
|
Negative |
0.874 |
0.891 |
0.882 |
9,680 |
As shown in Tables 2 and 3, the multi-dimensional approach outperforms single-dimension baselines by 12.4% average improvement.
3.3. Portfolio construction algorithm with ESG sentiment integration
The portfolio optimization framework integrates ESG sentiment signals through a multi-factor model that combines traditional risk and return factors with sentiment-derived alpha signals. The optimization algorithm employs modern portfolio theory principles enhanced with machine learning techniques to capture non-linear relationships between sentiment signals and expected returns. The framework incorporates transaction costs, liquidity constraints, and regulatory requirements to ensure practical implementation feasibility.
Dynamic allocation mechanisms adjust portfolio weights based on sentiment signal strength and persistence, implementing sophisticated signal filtering algorithms to distinguish between temporary sentiment fluctuations and sustained ESG trend changes. The optimization process utilizes quadratic programming techniques combined with regularization methods to prevent excessive concentration and maintain diversification objectives. Risk management protocols include scenario analysis, stress testing, and drawdown control mechanisms to ensure robust performance across different market conditions. Table 4 details the comprehensive parameter settings used in the portfolio optimization framework. Figure 2 presents the comprehensive ESG sentiment-driven portfolio allocation framework that implements these optimization processes.
|
Parameter Category |
Setting |
Value |
Unit |
|
Risk Management |
Maximum Volatility Target |
15.0 |
% Annual |
|
Risk Management |
Maximum Single Asset Weight |
5.0 |
% Portfolio |
|
Risk Management |
Sector Concentration Limit |
25.0 |
% Portfolio |
|
Transaction Costs |
Trading Cost Assumption |
25 |
Basis Points Per Trade |
|
Transaction Costs |
Rebalancing Threshold |
2.0 |
% Weight Change |
|
Transaction Costs |
Minimum Trade Size |
0.1 |
% Portfolio |
|
Optimization Parameters |
Look-back Window |
252 |
Trading Days Historical |
|
Optimization Parameters |
Rebalancing Frequency |
5 |
Trading Days Weekly |
|
Optimization Parameters |
Sentiment Signal Decay |
0.95 |
Daily Factor Persistence |
|
Risk Model |
Number of Factors |
8 |
Count Risk Factors |
|
Risk Model |
Estimation Universe |
1,247 |
Companies Coverage |
|
Risk Model |
Minimum History |
126 |
Trading Days Required |
The portfolio allocation framework visualization depicts a comprehensive decision-making system that processes real-time ESG sentiment signals through multiple analytical layers. The framework begins with news data ingestion and sentiment extraction, feeding into signal processing modules that evaluate sentiment strength, persistence, and reliability. The allocation engine integrates sentiment signals with traditional financial metrics through mathematical optimization algorithms, producing dynamic portfolio weights that respond to ESG sentiment changes while maintaining risk management constraints.
4. Experimental design and empirical analysis
4.1. Dataset description and performance metrics definition
The experimental dataset encompasses 1,247 publicly traded companies across 11 GICS sectors, representing approximately $45.7 trillion in market capitalization and providing comprehensive coverage of developed market equity investments. The temporal scope spans 36 months from January 2021 to December 2023, incorporating 786 trading days and approximately 54,750 news articles processed through the sentiment analysis framework. Market data includes daily stock prices, trading volumes, and corporate fundamentals sourced from Bloomberg and Refinitiv databases.
Performance evaluation metrics encompass traditional portfolio analytics including total return, volatility, Sharpe ratio, and maximum drawdown, supplemented with ESG-specific performance measures and alternative risk metrics. Risk-adjusted return calculations utilize multiple benchmark comparisons, including market capitalization-weighted indices, ESG-screened benchmarks, and factor-based comparison portfolios. Statistical significance testing employs bootstrap methodologies and Monte Carlo simulations to validate performance improvements and ensure robust empirical conclusions. Table 5 summarizes the key characteristics of the experimental dataset used in this analysis.
|
Sector Distribution |
Companies |
Percentage |
Market Cap ($B) |
|
Technology |
233 |
18.7% |
8,547.2 |
|
Financials |
202 |
16.2% |
7,402.8 |
|
Healthcare |
178 |
14.3% |
6,538.1 |
|
Consumer Discretionary |
160 |
12.8% |
5,849.6 |
|
Industrials |
127 |
10.2% |
4,622.3 |
|
Consumer Staples |
98 |
7.9% |
3,567.8 |
|
Energy |
87 |
7.0% |
3,162.4 |
|
Materials |
76 |
6.1% |
2,764.9 |
|
Utilities |
58 |
4.6% |
2,108.7 |
|
Real Estate |
19 |
1.5% |
869.4 |
|
Communication Services |
9 |
0.7% |
412.8 |
|
Total |
1,247 |
100.0% |
45,846.0 |
Market Cap Distribution:
Large-cap (>$10B): 841 companies (67.4%)
Mid-cap ($2B-$10B): 278 companies (22.3%)
Small-cap (<$2B): 128 companies (10.3%)
Geographic Coverage:
United States: 734 companies (58.9%)
Europe: 308 companies (24.7%)
Asia-Pacific: 205 companies (16.4%)
Sentiment Signal Coverage:
Environmental: 847,392 observations
Social: 923,148 observations
Governance: 756,284 observations
4.2. Comparative analysis of portfolio performance results
The comparative performance analysis evaluates four distinct portfolio strategies: traditional market capitalization-weighted benchmark, conventional ESG-screened portfolio, factor-based ESG integration approach, and the proposed ESG sentiment-driven methodology. Performance measurement encompasses multiple temporal horizons including daily, monthly, quarterly, and annual evaluation periods to capture both short-term responsiveness and long-term trend identification capabilities. Tables 6 and 7 present the comprehensive performance comparison results across different portfolio strategies and risk-adjusted metrics.
ESG sentiment-driven portfolios demonstrate superior performance across multiple metrics, achieving 14.7% annualized returns compared to 11.8% for traditional benchmarks and 12.9% for conventional ESG approaches. Volatility reduction benefits emerge through sentiment-based risk management, with ESG sentiment portfolios exhibiting 16.3% annualized volatility versus 19.7% for market benchmarks. The integration of real-time sentiment signals enables more responsive allocation decisions and improved market timing capabilities. Figure 3 demonstrates the cumulative performance evolution across different portfolio strategies throughout the experimental period.
|
Strategy Type |
Annual Return (%) |
Volatility (%) |
Sharpe Ratio |
Max Drawdown (%) |
Info Ratio |
|
ESG Sentiment-Driven |
14.73 |
16.32 |
0.902 |
-12.41 |
0.847 |
|
Factor-Based ESG |
13.24 |
17.89 |
0.740 |
-15.67 |
0.623 |
|
Conventional ESG Screen |
12.91 |
18.45 |
0.700 |
-16.89 |
0.581 |
|
Market Benchmark |
11.82 |
19.74 |
0.598 |
-18.73 |
0.523 |
|
Strategy Type |
Sortino Ratio |
Calmar Ratio |
Omega Ratio |
VaR (95%) |
|
ESG Sentiment-Driven |
1.342 |
1.187 |
1.284 |
-2.31% |
|
Factor-Based ESG |
1.089 |
0.845 |
1.156 |
-2.67% |
|
Conventional ESG Screen |
1.021 |
0.764 |
1.098 |
-2.89% |
|
Market Benchmark |
0.876 |
0.631 |
1.034 |
-3.24% |
Statistical Significance Testing:
Return Difference vs. Benchmark: p-value < 0.01
Excess Return 95% Confidence Interval: [1.84%, 3.97%]
Alpha Attribution: Security Selection (68.4%), Market Timing (31.6%)
Performance Attribution:
Sentiment Alpha Contribution: 2.91% annually
Risk Reduction Benefit: 3.42% volatility reduction
Transaction Cost Impact: -0.18% annually

The cumulative performance visualization illustrates the evolution of portfolio values across different strategic approaches over the 36-month experimental period. The ESG sentiment-driven strategy exhibits consistent outperformance with lower volatility compared to traditional benchmarks and conventional ESG screening methods. Notable performance divergence emerges during market stress periods, where sentiment-based risk management provides superior downside protection. The chart demonstrates accelerating performance advantages during the latter portion of the evaluation period, suggesting improved effectiveness as sentiment signals mature and market adoption increases.
4.3. Risk-adjusted returns and statistical significance testing
Statistical significance testing employs multiple methodological approaches including parametric t-tests, non-parametric Wilcoxon rank-sum tests, and bootstrap confidence interval estimation to validate performance improvements. The analysis incorporates multiple comparison adjustments and false discovery rate controls to ensure robust empirical conclusions. Monte Carlo simulations with 10,000 iterations provide comprehensive robustness testing across different market scenarios and parameter assumptions.
Risk-adjusted return metrics demonstrate consistent improvements across different risk measures and evaluation frameworks. Information ratios average 0.847 for ESG sentiment strategies compared to 0.523 for traditional approaches, indicating superior excess return generation relative to tracking error. Sortino ratios, which focus on downside volatility, achieve 1.342 for sentiment-driven portfolios versus 0.876 for conventional strategies, highlighting enhanced downside risk management capabilities.
Statistical significance results reveal p-values below 0.01 for return differences across all comparison frameworks, indicating robust empirical support for performance improvements. Confidence intervals for excess returns range from 1.84% to 3.97% annually with 95% confidence levels, demonstrating consistent economic significance alongside statistical validity. Factor attribution analysis indicates that 68.4% of excess returns derive from sentiment-based security selection effects, while 31.6% results from improved market timing capabilities.
5. Discussion and future research directions
5.1. Interpretation of results and investment implications
The empirical findings reveal substantial value creation potential through systematic integration of ESG sentiment analysis in portfolio construction processes. The 2.91% annual performance premium represents economically significant alpha generation that exceeds typical institutional investment management fees and provides compelling value propositions for sustainable investing strategies. Risk reduction benefits, evidenced through lower volatility and reduced maximum drawdowns, enhance the attractiveness of ESG sentiment approaches for risk-averse institutional investors.
The performance improvements demonstrate consistent patterns across different market conditions, sectors, and time periods, suggesting robust applicability beyond specific market environments. The superior risk-adjusted returns indicate that ESG sentiment signals contain valuable information not captured by traditional financial metrics or conventional ESG screening approaches. The real-time processing capabilities enable more responsive investment decisions and improved market timing compared to backward-looking ESG rating methodologies.
Investment implementation considerations include technology infrastructure requirements, data processing capabilities, and portfolio management system integration needs. The scalability of sentiment-driven approaches supports application across different asset classes, geographical markets, and investment mandates. Cost-benefit analysis reveals positive net value creation even after accounting for technology implementation costs and operational complexity.
5.2. Limitations and model robustness analysis
The research methodology faces several limitations that warrant consideration in practical implementation contexts. Data quality dependencies create potential vulnerabilities related to news source reliability, coverage bias, and temporal inconsistencies in sentiment signal generation. The model's reliance on English-language news sources may limit applicability in non-English speaking markets and create geographical bias in sentiment coverage.
Sentiment classification accuracy limitations, particularly for complex or ambiguous ESG content, may introduce noise into portfolio optimization processes. The model's sensitivity to market regime changes and evolving ESG terminology requires ongoing calibration and model updating to maintain effectiveness. Transaction cost assumptions and liquidity constraints may not fully capture real-world implementation challenges in certain market segments.
Robustness testing reveals stable performance across different market conditions, with minor degradation during extreme volatility periods. Sensitivity analysis indicates acceptable performance maintenance across reasonable parameter variations. The model demonstrates resilience to missing data scenarios and maintains functionality during news coverage gaps or technical disruptions.
5.3. Future enhancement opportunities and practical applications
Future research directions include expansion to additional asset classes, incorporation of multilingual sentiment analysis capabilities, and development of sector-specific sentiment models. Integration with satellite imagery, social media sentiment, and alternative data sources offers potential for enhanced signal generation and improved predictive accuracy. Real-time sentiment processing optimization and latency reduction represent important technical advancement opportunities.
Practical applications extend beyond portfolio management to include ESG risk monitoring, corporate engagement strategies, and sustainable finance product development. The methodology supports implementation in exchange-traded funds, institutional mandates, and retail investment products. Integration with robo-advisory platforms and digital wealth management services enables broad market adoption and democratized access to sophisticated ESG analytics.
Regulatory developments and standardization initiatives in sustainable finance create opportunities for broader adoption and integration with compliance frameworks. The methodology aligns with emerging disclosure requirements and supports enhanced transparency in ESG investment processes. Collaboration opportunities with data providers, technology vendors, and financial institutions facilitate ecosystem development and market adoption acceleration.
Acknowledgments
I would like to extend my sincere gratitude to Cao, E. and Sun, Y. for their groundbreaking research on ESG-based stock investment analytics using machine learning and natural language processing as published in their article titled [1] "A Data-Driven Analytical Framework for ESG-based Stock Investment Analytics using Machine Learning and Natural Language Processing" in the Proceedings of the 2023 12th International Conference on Networks, Communication and Computing. Their insights and methodologies have significantly influenced my understanding of advanced techniques in ESG investment analysis and have provided valuable inspiration for my own research in this critical area.
I would like to express my heartfelt appreciation to Kim, M., Kang, J., Jeon, I., Lee, J., Park, J., Youm, S., and Moon, J. for their innovative study on differential impacts of environmental, social, and governance news sentiment on corporate financial performance using advanced natural language processing models, as published in their article titled [2] "Differential Impacts of Environmental, Social, and Governance News Sentiment on Corporate Financial Performance in the Global Market: An Analysis of Dynamic Industries Using Advanced Natural Language Processing Models" in Electronics (2024). Their comprehensive analysis and sentiment modeling approaches have significantly enhanced my knowledge of ESG sentiment analysis and inspired my research in this field.
References
[1]. Cao, E., & Sun, Y. (2023, December). A Data-Driven Analytical Framework for ESG-based Stock Investment Analytics using Machine Learning and Natural Language Processing. In Proceedings of the 2023 12th International Conference on Networks, Communication and Computing (pp. 225-232).
[2]. Kim, M., Kang, J., Jeon, I., Lee, J., Park, J., Youm, S., ... & Moon, J. (2024). Differential Impacts of Environmental, Social, and Governance News Sentiment on Corporate Financial Performance in the Global Market: An Analysis of Dynamic Industries Using Advanced Natural Language Processing Models. Electronics, 13(22), 4507.
[3]. Zeidan, R. (2022). Why don't asset managers accelerate ESG investing? A sentiment analysis based on 13, 000 messages from finance professionals. Business Strategy and the Environment, 31(7), 3028-3039.
[4]. Zhang, M., Shen, Q., Zhao, Z., Wang, S., & Huang, G. Q. (2025). Optimizing ESG reporting: Innovating with E-BERT models in nature language processing. Expert systems with applications, 265, 125931.
[5]. Jaiswal, R., Gupta, S., & Tiwari, A. K. (2024). Decoding mood of the Twitterverse on ESG investing: opinion mining and key themes using machine learning. Management Research Review, 47(8), 1221-1252.
[6]. Fischbach, J., Adam, M., Dzhagatspanyan, V., Mendez, D., Frattini, J., Kosenkov, O., & Elahidoost, P. (2023, December). Automatic eSG assessment of companies by mining and evaluating media coverage data: NLP approach and tool. In 2023 IEEE International Conference on Big Data (BigData) (pp. 2823-2830). IEEE.
[7]. Goutte, S., Liu, F., Le, H. V., & Mettenheim, H. J. V. (2023). ESG Investing: A Sentiment Analysis Approach. Available at SSRN 4316107.
[8]. Babaeva, M. (2024). A Study on Quantifying ESG Alpha as an access return on portfolio using ESG Sentiment in the Banking Sector with an Automated Machine Learning Approach.
[9]. Andrikogiannopoulou, A., Krueger, P., Mitali, S. F., & Papakonstantinou, F. (2022). Discretionary information in ESG investing: A text analysis of mutual fund prospectuses. Available at SSRN 4082263.
[10]. Zhang, Y. (2023). ESG-Based Market Risk Prediction and Management Using Machine Learning and Natural Language Processing.
[11]. Jatowt, A. (2022). Predicting ESG Scores from News Articles using Natural Language Processing (Doctoral dissertation, University of Innsbruck).
[12]. Wang, M., & Zhu, L. (2024). Linguistic Analysis of Verb Tense Usage Patterns in Computer Science Paper Abstracts. Academia Nexus Journal, 3(3).
[13]. Liu, W., Qian, K., & Zhou, S. (2024). Algorithmic Bias Identification and Mitigation Strategies in Machine Learning-Based Credit Risk Assessment for Small and Medium Enterprises. Annals of Applied Sciences, 5(1).
[14]. Mo, T., Li, P., & Jiang, Z. (2024). Comparative Analysis of Large Language Models' Performance in Identifying Different Types of Code Defects During Automated Code Review. Annals of Applied Sciences, 5(1).
[15]. Xu, S. (2025). Intelligent Optimization Algorithm for Chain Restaurant Spatial Layout Based on Generative Adversarial Networks. Journal of Industrial Engineering and Applied Science, 3(3), 32-41.
Cite this article
Cai,Y. (2025). Impact of ESG News Sentiment Analysis Based on Natural Language Processing on Investment Portfolio Performance. Applied and Computational Engineering,181,201-213.
Data availability
The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.
Disclaimer/Publisher's Note
The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
About volume
Volume title: Proceedings of CONF-FMCE 2025 Symposium: Semantic Communication for Media Compression and Transmission
© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and
conditions of the Creative Commons Attribution (CC BY) license. Authors who
publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons
Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this
series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published
version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial
publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and
during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See
Open access policy for details).
References
[1]. Cao, E., & Sun, Y. (2023, December). A Data-Driven Analytical Framework for ESG-based Stock Investment Analytics using Machine Learning and Natural Language Processing. In Proceedings of the 2023 12th International Conference on Networks, Communication and Computing (pp. 225-232).
[2]. Kim, M., Kang, J., Jeon, I., Lee, J., Park, J., Youm, S., ... & Moon, J. (2024). Differential Impacts of Environmental, Social, and Governance News Sentiment on Corporate Financial Performance in the Global Market: An Analysis of Dynamic Industries Using Advanced Natural Language Processing Models. Electronics, 13(22), 4507.
[3]. Zeidan, R. (2022). Why don't asset managers accelerate ESG investing? A sentiment analysis based on 13, 000 messages from finance professionals. Business Strategy and the Environment, 31(7), 3028-3039.
[4]. Zhang, M., Shen, Q., Zhao, Z., Wang, S., & Huang, G. Q. (2025). Optimizing ESG reporting: Innovating with E-BERT models in nature language processing. Expert systems with applications, 265, 125931.
[5]. Jaiswal, R., Gupta, S., & Tiwari, A. K. (2024). Decoding mood of the Twitterverse on ESG investing: opinion mining and key themes using machine learning. Management Research Review, 47(8), 1221-1252.
[6]. Fischbach, J., Adam, M., Dzhagatspanyan, V., Mendez, D., Frattini, J., Kosenkov, O., & Elahidoost, P. (2023, December). Automatic eSG assessment of companies by mining and evaluating media coverage data: NLP approach and tool. In 2023 IEEE International Conference on Big Data (BigData) (pp. 2823-2830). IEEE.
[7]. Goutte, S., Liu, F., Le, H. V., & Mettenheim, H. J. V. (2023). ESG Investing: A Sentiment Analysis Approach. Available at SSRN 4316107.
[8]. Babaeva, M. (2024). A Study on Quantifying ESG Alpha as an access return on portfolio using ESG Sentiment in the Banking Sector with an Automated Machine Learning Approach.
[9]. Andrikogiannopoulou, A., Krueger, P., Mitali, S. F., & Papakonstantinou, F. (2022). Discretionary information in ESG investing: A text analysis of mutual fund prospectuses. Available at SSRN 4082263.
[10]. Zhang, Y. (2023). ESG-Based Market Risk Prediction and Management Using Machine Learning and Natural Language Processing.
[11]. Jatowt, A. (2022). Predicting ESG Scores from News Articles using Natural Language Processing (Doctoral dissertation, University of Innsbruck).
[12]. Wang, M., & Zhu, L. (2024). Linguistic Analysis of Verb Tense Usage Patterns in Computer Science Paper Abstracts. Academia Nexus Journal, 3(3).
[13]. Liu, W., Qian, K., & Zhou, S. (2024). Algorithmic Bias Identification and Mitigation Strategies in Machine Learning-Based Credit Risk Assessment for Small and Medium Enterprises. Annals of Applied Sciences, 5(1).
[14]. Mo, T., Li, P., & Jiang, Z. (2024). Comparative Analysis of Large Language Models' Performance in Identifying Different Types of Code Defects During Automated Code Review. Annals of Applied Sciences, 5(1).
[15]. Xu, S. (2025). Intelligent Optimization Algorithm for Chain Restaurant Spatial Layout Based on Generative Adversarial Networks. Journal of Industrial Engineering and Applied Science, 3(3), 32-41.