AI prompts in Power Platform

How AI Builder and Copilot Studio promote prompts as a first-class artefact — prompt design, parameters, grounding, and the operational patterns that make AI prompts reliable in business apps.

Updated 2026-06-21

Across the Power Platform, generative AI is exposed through AI prompts — reusable, parameterised templates that call an LLM with consistent instructions and structured inputs. Treating prompts as artefacts (versioned, ALM-deployed, tested) rather than ad-hoc strings is the difference between a flashy demo and a production-grade AI feature.

Where prompts live. Prompts can be authored in:

  • AI Builder — the canonical home; "AI prompts" is a model type alongside form processors, classifiers, etc.
  • Copilot Studio — embedded in topic logic and agents.
  • Power Automate — invoked from flows.
  • Power Apps — invoked from canvas apps directly.

The prompt itself is a Dataverse-stored asset, solutionable, exportable.

Prompt anatomy.

  • Instruction text — the prompt body, with placeholders.
  • Input parameters — typed values supplied at invocation.
  • Output type — text, table, structured JSON.
  • Model selection — GPT-4, GPT-4o, or other available models.
  • Grounding data — knowledge sources the prompt can reference.

Designing a prompt. A production prompt typically has structure like:

  1. Role and context — "You are an expert in [domain]. Your job is to [task]."
  2. Input data — labelled values: "Customer message: . Account history: ."
  3. Instructions — what to produce, format constraints.
  4. Examples — few-shot examples showing desired output.
  5. Constraints and edge cases — what not to do, how to handle empty inputs.

Most prompt failures trace to vague instructions or missing examples. Iterate with real inputs until the output is reliable.

Output types.

  • Text — free-form response. Easy to call, hard to parse downstream.
  • Structured (JSON) — defined schema; the model is constrained to produce JSON matching the schema. Far more reliable for downstream automation.

Use structured outputs whenever the downstream step needs specific fields. Free-form text is fine for end-user display; downstream automation should rely on structured.

Grounding. A prompt can be grounded in:

  • Specific Dataverse rows — pulled in at invocation.
  • Documents — files in SharePoint, OneDrive, or AI Builder document collections.
  • Knowledge sourcesCopilot Studio knowledge.
  • Search results — Microsoft Search or Bing.

Grounding reduces hallucinations and aligns output to authoritative data.

Invocation patterns.

  • From a flow — the "Create text with GPT using a prompt" action.
  • From a canvas appAIBuilder.PromptName.Predict({inputs}) Power Fx call.
  • From a low-code plug-in — Power Fx call within plug-in logic.
  • From a Copilot Studio agent — generative actions triggered by intent.

Versioning and ALM. Prompts are versioned in the maker portal. Solutions export the current version; consumers reference the prompt by ID. Promotion through dev → test → prod follows standard solution ALM. Be careful about referencing prompts by name vs by ID — name changes break references.

Testing. The prompt builder has a test pane:

  • Provide sample inputs.
  • See the model's output.
  • Iterate.

Production prompts need more than UI testing — a small test harness in Power Automate or a Python script can regression-test prompts against a corpus of representative inputs.

Cost. Each prompt invocation costs AI credits per AI Builder licensing. Credits convert roughly to token usage. High-volume scenarios need careful cost modelling: a prompt firing on every Dataverse Create event at 10K events/day adds up.

Latency. LLM responses take seconds, not milliseconds. UX-blocking calls feel slow. Patterns:

  • Async invocation — kick off the prompt, show "processing", display result when ready.
  • Streaming output — display tokens as they arrive (where supported).
  • Caching — same inputs → same output; cache aggressively.

Common pitfalls.

  • Untested edge cases — empty inputs, very long inputs, unusual character sets break the prompt.
  • Trust-on-first-use — outputs assumed correct without verification; hallucinations slip into production.
  • No versioning hygiene — prompts edited live in prod; no rollback path.
  • Free-form text downstream — parsing brittle; structured output is the fix.
  • Cost overruns — high-volume invocation without monitoring; surprise bill.
  • Prompt injection vulnerability — user input flowing into prompts without sanitisation; users craft inputs to override instructions.

Prompt injection mitigation.

  • Treat user input as data, not instruction. Wrap user input in clear delimiters; instruct the model to ignore instructions in user input.
  • Validate output before acting — even if the model is "told" not to produce certain outputs, treat outputs as untrusted.
  • Limit privileges — a prompt that triggers an action shouldn't have full system permissions.

Operational guidance. Treat prompts like code: version, test, deploy through pipelines, monitor in production. The "type into a box and run it" feel of AI Builder makes it easy to skip these practices — and easy to ship fragile AI features that break under real usage. Discipline pays back.

Related guides