A human brain emerging from a smartphone with interconnected digital icons and symbols surrounding it, illustrating technology and information integration in a digital, futuristic environment.

Behind the Screens: Using AI to Dissect Social Media Persuasion

2 min read

Apr 08, 2024

AI's role in comprehending the complexities of human communication on social media presents unparalleled prospects for behavioral research. Our investigation focuses on how Large Language Models (LLMs) can transform our ability to recognize and analyze the persuasive strategies used by financial influencers across several platforms.

This approach to AI-driven insights demonstrates not only the integration of technology and human behavior but also the specific methodology we used to extract and classify massive volumes of content. By concentrating on the particular use of LLMs in social media listening, we highlight its crucial significance in providing a new viewpoint on how digital communication influences consumer perceptions.

The Objective

We had the challenge of understanding the impact social media has on retail investors. A part of our research was aimed at analyzing the types of persuasive techniques influencers employ to encourage their audience to make a particular financial decision. The study focused on influencers across social media platforms including YouTube, X, TikTok, Instagram, and Reddit. This type of work usually follows two main steps:

Extract the available social media content and clean the data to be analyzed.
Use traditional natural language processing techniques and algorithms to classify the data and extract insights.

Large Language Models (LLMs), such as ChatGPT and LLaMA, are algorithms that excel in classifying and analyzing text—which, in this case, are the millions of words we gathered from social media platforms. LLMs can aid in both of these steps: cleaning and classifying the data.

Cleaning the data

In the initial data cleaning phase, LLMs can scan through the whole dataset to pinpoint social media posts that pertain to financial advice. Traditional cleaning techniques for this type of work might revolve around matching the text to specific keywords referencing financial advice. If a post contained one or multiple words from a preselected group of keywords, we would classify such posts as containing financial advice content. One of the benefits of LLMs is they move away from preselection biases and classify the posts based on the prompt that is indicated. We opted not to utilize this specific approach in our study because we sourced our data from specific financial influencers selected for their popularity with the intent of assessing their broader communication patterns.

It is possible to feed a dataset of text to an LLM to indicate if the text contains what could be described as financial advice. Given the technical limitations of the current architecture of LLMs, we have to feed the AI with a social media post from the database and the classification prompt one by one to be classified. If done manually, this process would take a significant amount of time. Instead, we can automate this process by connecting to the LLM Application Programming Interface (API), an interface that enables us to interact directly with AI models. This connection allows for the automatic submission and labeling of social media posts, speeding up the study by processing large amounts of data quickly and effectively.

This process allows for a better cleaning phase since context is crucial for text classification. Modern LLMs based on a specific neural network architecture called transformers are built to consider the context of the entire text.¹ For example, if we use “invest” as a keyword, we could classify the following two statements as financial advice:

“Invest in crypto”
”Musicians invest emotion”

LLMs can take the context of the entire text and classify it correctly: “Invest in crypto” as financial advice and ”Musicians invest emotion” not as financial advice.

This process is not exempt from errors since we have just introduced a possible data bias by relying on the AI’s training data to classify for us. Such training data could be disproportionately inclined toward certain classifications. An example of this bias could occur if the LLM was disproportionately trained on a tech-centric dataset. The LLM might be more adept at classifying texts related to technology investments as financial advice while also misclassifying statements about investments in other sectors, such as agriculture or education.

This is a known limitation and AI companies are constantly trying to minimize this risk.

Classifying the data

LLM application is also useful in the second part of natural language processing: classifying data and extracting insights. The behavioral application for this particular case is that we can prompt the LLM to identify and classify specific persuasive communication techniques within the data. Using AI in this manner, we can loop over the millions of words containing financial advice from social media and quantify the persuasive techniques such as authority, scarcity, and social proof used by influencers.

We apply the same methodology to classify the construal of social media advice. Construal refers to the level of abstraction or specificity with which information is presented in a message. Low construal refers to thinking that focuses on concrete, specific details and immediate actions. Meanwhile, high construal involves abstract, general principles and long-term goals. Here are two hypothetical examples to help you get a better idea of construal in this context:

Low-construal: "Start by setting aside $200 each month into a low-cost index fund. Use automatic transfers from your checking to your investment account on the first of each month to ensure consistency."
High-construal: "Focus on building a diverse investment portfolio to secure your financial future. Prioritize investments that align with your long-term financial goals and risk tolerance."

Low-construal messages on social media are viewed as more credible, reducing the perceived riskiness of the information within, resulting in an increased potential for persuasion.² The findings of our research are yet to be publicly available.

Final Considerations

Some limitations of this methodology are the training biases of LLMs as previously mentioned, as well as the necessity of connecting to an AI model available through an API. This means that you must share your data with a third party—and in some cases, when the text to be analyzed contains sensitive information, this approach is not viable. It is also important to mention that each call to the API will have a small cost depending on the length of the text and the model being used. The cost varies depending on the provider (in our case, the cost per post classified was a fraction of a cent) but can start to be significant with bigger datasets and large numbers of classifications.

In summary, we found a way for AI to classify social media posts based on the persuasive techniques used by influencers and the level of construal in a more streamlined and precise manner than traditional methodologies. This allowed us to analyze behavior at a great scale within a digital environment. The specific findings from the mentioned study will be shared later this year with the complete report published by one of our clients.

This is just one application of how AI can be used to solve challenges using behavioral science. If you are interested, reach out to discuss how we can tailor these strategies to achieve success in your field.

Case studies

From Insight to Impact: Our Success Stories

See Case Studies

Is there a problem we can help with?

See how we work

References

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. In Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS'17).
Balaji, M. S., Jiang, Y., & Jha, S. (2021). Nanoinfluencer marketing: How message features affect credibility and behavioral intentions. Journal of Business Research, 136, 293-304.

About the Author

A man in a suit and tie stands and gestures while presenting in front of a projected screen displaying text and bullet points inside a lecture hall or conference room.

Jerónimo Kanahuati

Jero is a Consultant at The Decision Lab with a passion for artificial intelligence and behavioral science. Prior to joining The Decision Lab he founded a startup in Mexico to develop apps for kids to encourage education, and developing web scraping bots. He also worked at Google as an account manager and technical specialist focused on ad placement across Google's products. Jero has a bachelor's degree in engineering and a postgraduate specialty degree in operations from Universidad Panamericana in Mexico City.

About us

We are the leading applied research & innovation consultancy

Our insights are leveraged by the most ambitious organizations

“

I was blown away with their application and translation of behavioral science into practice. They took a very complex ecosystem and created a series of interventions using an innovative mix of the latest research and creative client co-creation. I was so impressed at the final product they created, which was hugely comprehensive despite the large scope of the client being of the world's most far-reaching and best known consumer brands. I'm excited to see what we can create together in the future.

Heather McKee

BEHAVIORAL SCIENTIST

GLOBAL COFFEEHOUSE CHAIN PROJECT

OUR CLIENT SUCCESS

$0M

Annual Revenue Increase

By launching a behavioral science practice at the core of the organization, we helped one of the largest insurers in North America realize $30M increase in annual revenue.

Increase in Monthly Users

By redesigning North America's first national digital platform for mental health, we achieved a 52% lift in monthly users and an 83% improvement on clinical assessment.

Reduction In Design Time

By designing a new process and getting buy-in from the C-Suite team, we helped one of the largest smartphone manufacturers in the world reduce software design time by 75%.

Reduction in Client Drop-Off

By implementing targeted nudges based on proactive interventions, we reduced drop-off rates for 450,000 clients belonging to USA's oldest debt consolidation organizations by 46%

Let's chat

Consulting

Industries

Resources