ChatGPT applications (NLP perspectives)

3 min read

ChatGPT, again?

I know, I know. As much as I hate to have a say with what everybody is talking about, I'm only going to talk about on the topic, NLP.

ChatGPT and NLP

As you know, ChatGPT or simply GPT for that matter, is based on a (very) large language model. And ChatGPT truly opened the door for many opportunities for greater public (including companies like Microsoft or NVIDIA).

At its core is the resourceful language context and as a NLP practitioner, I've wondered if it really helps me with routine text analytics I had implemented with painstaking preprocessing (NER, News type classification and so on).

The short answer is: Absolutely!

In fact, its outcome was surprisingly good enough to be skeptical of myself for the time I had spent for those tasks.

While not perfect nor free to use (my API credits have expired without me being noticed), ChatGPT clearly satisfies its core colleagues, text practitioners.

Selected applications

Among every possible application (too many if I had to mention all), I found the following relevant to my area of expertise and interest.

Topic Classification (BERTopic)

These days, I use BERTopic most frequently (on Google Colab). With version 0.14.0, BERTopic started to support ChatGPT in its topic representations. By this addition, users can enhance the resultant topics into more sensible sentences.

This is the easiest thinkable application of ChatGPT without much effort. It says it supports ChatGPT and you can just take advantage of it.

Extracting company names (NER)

This is one of the classic NLP examples which require context awareness. The easiest example would be Apple and apple in pretty different contexts.

Apple launched newer version of Airpods a few weeks ago.
An apple is on the table.

I admit that this may be a too simple example. Anyway, the point is that without a context, we aren't so sure about the validity of NER.

This is where ChatGPT shines too. LLMs have been trained with plenty of texts with their contexts, therefore they are able to distinguish better which are companies of interests and others aren't.

NER

Filtering out promotional news

My day job is to analyze news text. Many of the articles I come across are not purely informative or do not serve my purpose.

In Korean articles, though my experience, higher percentage among them are promotional news for specfic companies compared to news in English.

As before, an easy task for human is not for a computer without considering the context.

However, ChatGPT can do the job easier albeit not entirely as impressive as the previous task.

AD

Closing

Currently, I am in the stage of experiment for these tasks leveraging ChatGPT.

While it makes our job easier, one has to keep in mind that due to its high demand and rate limit, processing large volume of data might not be ideal. NER task is especailly burdensome as it usually has to read the whole text to know which to extract.

Even if we can bear the time delay using ChatGPT, how about the bill?

This brilliant tool clearly has a lot of advantage to be taken advantage of, however, how to leverage it in the best way is left to users.

CC BY-NC 4.0 © min park.RSS