Fine-tuning open-source LLMs for Multi-entity Sentiment Analysis task using [mlx-lm]

April 18, 2026Project

Multi-entity Sentiment Analysis revisited

As first laid out in this post, Multi-entity Sentiment Analysis was one of the toughest tasks I have been pondering about.

How intelligent models were, msot of them failed to capture the separate sentiment extraction per entity.

Take the following sentence for example.

Apple shares shot up thanks to iPhone sales, while its peers struggled with the increased AI spending.

Unlike human readers who can capture the essence of the text, since many sentiment models (or simple bag-of-words) try to determine the overall sentiment of the given text, not of specific entity, juxtaposing contrasting polarity often confuses the models.

(Original) Approaches

The main approach when I first floated the idea of multi-entity sentiment analysis was to simplify and formulate the sentence structure, which might confuse the language model less.

To simplify the sentence, I had organized the following structure.

Entity: (Subject and object)
Polarity: + / - / 0
Direction of the polarity:
- Posline (Positive/Positive): Both entities have the same directional polarity (positive)
- Pos (Positive/Neutral): One of the entity has the positive polarity
- Over (Positive/Negative): While the first entity is positive, latter has the negative polarity
- Under (Negative/Positive): First entity has the negative polarity, while the latter is positive
- Negline (Negative/Negative): Both entities have the same directional polarity (negative)
- Neg (Negative/Neutral): One of the entity has the negativ polarity
Category:
- Investments: Buy, Sell, IPO, Privatization, Invest, Bid
- Cooperation: Win-win situations (in-tandem)
- Family / Ownership: Same line of business (e.g., Franchise)
- Performance: Stock market performance
- Legal: File, [Sued / Indicted / Subpoenaed / Alleged] (by), Win, Lose (Bidirectional)
- News release: Launch, Patent, Authorization
- Bankruptcy: Entered, Exited

Multi-entity sentiment analysis

As with the coref, time has changed and LLMs can help a lot with the most tedius tasks, i.e., generating train/validation datasets.

The above definition came in handy for Claude to refine and generate datasets needed for the training process.

Entity: (Subject and object)
Polarity: + / - / 0 / ~
Category: Legal / Business / Performance / Recruitment / NewsRelease / Bankruptcy

Sample dataset is as follows.

{"id": "eval-016", "text": "Apple launched its first generative AI features across iPhone and Mac, positioning Apple Intelligence as a privacy-first alternative to ChatGPT.", "extractions": [{"entity": "Apple", "polarity": "+", "category": "NewsRelease"}, {"entity": "Apple Intelligence", "polarity": "+", "category": "NewsRelease"}, {"entity": "ChatGPT", "polarity": "-", "category": "NewsRelease"}]}