Prompting for Gen-AI

Basics of Prompt Engineering for LLMs

General Agenda

Part 1: Introduction to Gen-AI

Welcome and Overview
What are Gen-AI Tools or LLMs?
Importance of LLM Settings

Part 2: Techniques in Prompt Engineering

Basics and elements of Prompting.
Key Components of Prompt Engineering
General Tips for Designing Prompts
Strategies for Effective Prompting
Techniques in Advanced Prompt Formulation

Part 3: Exploring ChatGPT with Data Analytics

Brief Introduction to ChatGPT Plugins for Data Analytics
Other useful plugins or tools

Closing

Feedback and Closing Remarks

A Little about me

I am University Teacher in Statistics (Teaching Fellow - FHEA) at the School of Mathematics (SoM), at the University of Edinburgh since 2022.

Beyond being a member teaching team in the SoM
- TEMSE seminars co-organiser and Leading Gen-AI related discussions
- Being a part of EdinbR and RSS Edinburgh local group
- Contributing for Pair Programming club & CDCS

Intersection of fields

Huge Public Interest

Power behind it ?

Main advancement since 2017 ?

Advantages and Limitations of Transformers

Parallelization and Efficiency
Long-Range Dependencies
Transfer Learning and Contextual Embeddings
Attention Overhead for Long Sequences
Lack of Sequential Order
Excessive Parameterization
Fixed Input Length and Inability to Handle Unstructured Inputs

Importance of Attention ?

LLM tools landscape

What about ChatGPT ?

Only decoder type LLM indeed, but the output improvement relies on the concept of RLHF (Reinforcement Learning with Human Feedback)

Strengths
- Understanding of Context, Large-Scale Language Model, Fine-Tuning Process
Limitations
- Biases, Inappropriate or Unsafe Outputs, Inability to Fact-Check

ChatGPT structure

Some LLM news - 2023!

Multimodality already for some!
As apps (ChatGPT android)
Specific: ChatGPT can see, hear and speak (’’ChatGPT can now see, hear, and speak.
Custom GPTs are released already! More recently, they released GPT store
Upcoming: “Gemini is built from the ground up for multimodality — reasoning seamlessly across text, images, video, audio, and code.”

Some LLM news - 2024!

Different versions of GPT (including 4o, mini, o1-preview for advanced reasoning), lastly with GPT-4o with canvas
Voice option is working now for GPT and more recently introduced ChatGPTSearch
Claude 3.5 Sonnet, and Claude 3.5 Haiku - can direct Claude to use computers the way people do
Meta’s Next Llama AI Models Are Training on a GPU Cluster ‘Bigger Than Anything’ Else ?
Increase on the multi-agent frameworks for different applications!

What about available tools that we can play with ?

Which tool we can use?

OpenAI: GPT-3.5 or recent version GPT4 if you have paid membership
Google: Bard now, Gemini later. For Bard
Antrophic: Claude-2 with its website
Meta: Llama-2 (70B) via a specific website
Mistral: Check out their platform as open-source alternative
Microsoft Copilot is available for university! announced as Protected Mode for users from educational institutions
University ELM platform: Edinburgh (access to) Language Models, via ELM

Prompt Engineering

This is mainly about writing efficient prompts to use language models (LMs) and possible to consider it as an iterative process.
It allows us to better understand the capabilities and limitations of large language models (LLMs)
Researchers generally use prompt engineering to improve the capacity of the considered LLMs on a wide range of tasks including (but not limited) to question answering and arithmetic reasoning.

About LLM settings

When designing and writing prompts, we typically interact with the LLM via an API or more friendly interfaceas such as ChatGPT etc.
It is crucial to know configured model parameters and how they work in general!

What about these main parameters ?

LLM Model Parameters

Temperature
- Low: Produces deterministic, factual outputs. Ideal for fact-based Q&A type tasks .
- High: Encourages creative, diverse responses. Useful for creative tasks like creating poetry or story.
Top P (Nucleus Sampling)
- Low: Yields exact, confident answers. Best for factual responses.
- High: Offers diverse outputs, considering less likely words.

Max Length
- Controls output token count. Helps avoid overly long or irrelevant responses, managing cost.
Stop Sequences
- Strings that terminate token generation. Used to limit response length and structure.
Frequency Penalty: Penalizes repeated tokens based on frequency. Reduces word repetition in responses.
Presence Penalty: Penalizes all repeated tokens equally. Prevents phrase repetition. Adjust for creativity or focus.

Note

The general recommendation is to alter temperature or Top P but not both.

Note

Similar to temperature and Top P, the general recommendation is to alter the frequency or presence penalty but not both

Warning

Keep in mind that your results may vary depending on the which LLM or which version of LLM you use.

Some further details from OpenAI

Basic Template

Prompt: What we want from LLM
Output: What we will get from the LLM

Note

A prompt can contain information like the instruction or question you are passing to the model and include other details such as context, inputs, or examples

Tip

It is possible to use such elements when they are necessary to guide the LLM more effectively, for the purpose of output improvement!

Elements of a Prompt

Instruction: a specific task or instruction you want the model to perform
Context: external information or additional context that can steer the model to better responses
Input Data: the input or question that we are interested to find a response for
Output Indicator the type or format of the output.

Warning

No need all the four elements for a prompt and the format depends on the task.

Some Examples

ELM Use-cases

Examples demonstrating the application of ELM across different themes;

Code Writing
Writing a Literature Review
Summarising a material
Searching for academic information
Learning a subject
Developing Teaching Plans
Creating Assessments
Generating Research Ideas
Checking response to tutorial questions
Create Templates

OpenAI suggestions

Write clear instructions
Provide reference text, if we need
Split complex tasks into simpler sub-tasks
Give the model time to “think”
Use external tools or knowledge, if we need
Test changes systematically

Note

For each strategy we can see various given tactics below

Write clear instructions

Include details in your query to get more relevant answers
This is like feeding LLM with important details or context
Ask the model to adopt a persona
- You act as a workshop tutor!
Use delimiters to clearly indicate distinct parts of the input
- specify the related content explicitly, ie.
  insert first article here
Specify the steps required to complete a task
- Spliting the task into smaller parts
Provide examples
Specify the desired length of the output

Provide reference text

Instruct the model to answer using a reference text
- Adding extra information
Instruct the model to answer with citations from a reference text
- Adding external source

Split complex tasks into simpler subtasks

Give models time to “think”

Use external tools

Use embeddings-based search to implement efficient knowledge retrieval
More technical, such as RAG method
Use code execution to perform more accurate calculations or call external APIs
Such as guiding LLM to do something with another tool!
Give the model access to specific functions

Testing changes

Evaluation procedures (or “evals”) are useful for optimizing system designs

Evaluate model outputs with reference to gold-standard answers

Note

More advanced prompting engineering techniques that allow us to achieve more complex tasks and improve reliability and performance of LLMs

Risks & Misuses

Note

Remember that LLM outputs may include hallucination in general!

Advanced Methods

These are some techniques from the open book of Prompt Engineering Guide. Including but not limited to,

Zero-Shot Prompting
Few-Shot Prompting
Chain-of-Thought Prompting
Generated Knowledge Prompting
PAL (Program-Aided Language Models)
Retrieval Augmented Generation (RAG)

Few-shot learning

Generally, LLms are capable of performing some tasks “zero-shot.” Just asking what we want simply
When zero-shot doesn’t work, it’s recommended to provide demonstrations or examples in the prompt which leads to few-shot prompting
This technique enables in-context learning where we provide demonstrations in the prompt to improve the model performance.
For more difficult tasks, we can experiment with increasing the demonstrations (e.g., 3-shot, 5-shot, 10-shot, etc.)

Warning

It is better than zero-shot to some extent but may fail especially when dealing with more complex reasoning tasks!

Chain of Thought (CoT)

This method has been popularized to address more complex arithmetic, commonsense, and symbolic reasoning tasks.
Chain-of-thought (CoT) prompting enables complex reasoning capabilities through intermediate reasoning steps (Wei et al. (2022))
It is possible to combine it with few-shot prompting to get better results on more complex tasks
You can see different types such as Zero- or Few-shot COT Prompting or Automatic Chain-of-Thought (Auto-CoT)

CoT Example

More technical: RAGs

For more complex and knowledge-intensive tasks, it’s possible to build a language model-based system that accesses external knowledge sources
This is a common and one of the cheapest option to reduce the problem of “hallucination” in general
RAG combines an information retrieval component with a text generator model.
RAG allows language models to bypass retraining, enabling access to the latest information for generating reliable outputs via retrieval-based generation

Workflow of RAG

Image Credit: Langchain documentation

There are some code-based examples (if you are interested)

Short notes and examples in Python
Short demo of Embedchain for a webpage
Short demo of Langchain for a pdf file

About GPTs via OpenAI

Note

Brief Introduction to ChatGPT Advanced Data Analytics and about GPTs!

“Discover and create custom versions of ChatGPT that combine instructions, extra knowledge, and any combination of skills.”

Other LLM related tools can be;

Scholar ai -
Elicit - Analyze research papers
Perplexity AI - Conversational search engine
…

Other Resources on Prompting

promptingguide.ai: A prompt engineering guide that demonstrates many techniques. Especially the hub page can be good to explore
OpenAI Cookbook
OpenAI Prompt Examples
Prompting libraries & tools

Google Prompt Engineering page
How to Use DALL-E 3: Tips, Examples, and Features
Brex’s Prompt Engineering Guide Brex’s introduction to language models and prompt engineering.

Time to Reflect

Please share your feedback to improve this training