Psychological Flexibility as a KPI: Why Your Clinic Should Track What Actually Predicts Outcomes

Psychological flexibility accounts for a striking proportion of the variance in mental health and wellbeing outcomes, yet most clinics still define success by how many sessions a client attends. When researchers like Kashdan and Rottenberg describe psychological flexibility as a "fundamental aspect of health," and meta-analytic data from over 12,000 participants confirms it predicts wellbeing across conditions, the question is not whether we should track it. The question is why we are not already doing so.

The metric most clinics ignore, and why it matters more than session count

Walk into most clinic management meetings and you will hear the same numbers: session volume, rebooking rates, client satisfaction scores, maybe revenue per practitioner. These are operational metrics. They tell you how the business is running. They tell you almost nothing about whether your clients are actually getting better.

This is not a trivial gap. Hayes et al. positioned psychological flexibility as the central process of change in Acceptance and Commitment Therapy, and subsequent research has expanded this claim well beyond ACT. A growing body of evidence suggests that flexibility, the capacity to contact the present moment, hold difficult thoughts and feelings without being dominated by them, and move toward valued action, is one of the strongest transdiagnostic predictors of therapeutic change we have.

Yet the default KPIs in most practices are proxies at best. High session attendance could mean a client is engaged, or it could mean they are stuck. A glowing satisfaction survey might reflect genuine progress, or a pleasant therapeutic relationship that is not producing meaningful change. We have been measuring the container and ignoring the contents.

The clinics that will lead the next decade of integrative care are the ones that start measuring what the evidence says actually matters. That means treating psychological flexibility not as an abstract therapeutic concept, but as a trackable, operational KPI. As explored in our practical guide to measurement-based care in 2026, the tools and frameworks to do this are already within reach.

What psychological flexibility actually is, and is not

Before we can measure something meaningfully, we need to define it precisely. Psychological flexibility is often described in shorthand as "the ability to be present, open up, and do what matters." That is a useful starting point, but it can veer into self-help platitude if we are not careful.

The six core processes

In the ACT model, psychological flexibility is not a single trait. It is the dynamic interaction of six core processes, often visualised as the "hexaflex": acceptance, cognitive defusion, present-moment awareness, self-as-context, values, and committed action. Each process represents a skill that can be developed, not a fixed personality characteristic.

Acceptance involves willingness to experience difficult emotions without avoidance. Cognitive defusion is the ability to notice thoughts as thoughts, rather than literal truths that demand action. Present-moment awareness is flexible attention to here-and-now experience. Self-as-context refers to a stable sense of self from which experiences can be observed. Values clarify what matters. Committed action is behaviour guided by those values, even in the presence of discomfort.

What flexibility is not

It is worth naming several common conflations. Psychological flexibility is not resilience, though the two are related. Resilience implies bouncing back; flexibility implies a broader repertoire of responses to changing circumstances. It is not positive thinking. A psychologically flexible person does not replace negative thoughts with positive ones; they change their relationship to all thoughts. And it is emphatically not emotional suppression. In fact, experiential avoidance, the opposite of acceptance, is one of the strongest predictors of psychopathology (Hayes et al., 2006).

How we measure it

Two validated instruments dominate the literature. The Acceptance and Action Questionnaire-II (AAQ-II; Bond et al., 2011) is a seven-item self-report measure that has been used in hundreds of studies. It is brief, well-validated, and freely available. The Comprehensive Assessment of ACT Processes (CompACT; Francis et al., 2016) offers a more nuanced picture, capturing three subscales: openness to experience, behavioural awareness, and valued action. It is longer (23 items) but gives clinicians richer data.

Neither instrument is perfect. Self-report measures carry inherent limitations: social desirability bias, varying levels of self-awareness, and cultural framing effects. We will return to these limitations later. But as clinical tools, both the AAQ-II and CompACT offer something rare: a direct, quantifiable window into the process of change, not just its downstream symptoms.

The evidence: why flexibility predicts outcomes across conditions

The empirical case for psychological flexibility as a transdiagnostic outcome is substantial, though we should be precise about what the evidence does and does not support.

The headline numbers

conducted a comprehensive meta-analysis of 174 samples totalling 12,477 participants. They found that psychological flexibility was significantly associated with a wide range of wellbeing and mental health outcomes. The relationship held across clinical and non-clinical populations, across different measurement tools, and across conditions including depression, anxiety, chronic pain, and workplace stress. Effect sizes were generally medium to large.

Kashdan and Rottenberg made an even broader claim in their influential review, arguing that psychological flexibility should be considered a fundamental aspect of health, on par with physical biomarkers. Their synthesis drew from clinical, social, personality, and developmental psychology literatures.

Digital delivery shows promise

For clinicians considering how to integrate flexibility tracking with digital tools, the evidence on technology-delivered ACT is encouraging. reviewed early digital ACT interventions and found generally positive effects on psychological flexibility and related outcomes. More recent work has extended these findings, as we detail in our examination of the evidence for delivering ACT through technology.

This matters for a practical reason: if your clinic is going to track flexibility as a KPI, the most data-rich approach involves capturing it between sessions, in clients' daily lives, not just in the waiting room. Digital delivery and digital measurement are closely linked.

Where evidence is strong versus emerging

Honesty about evidence strength matters. The link between psychological flexibility and outcomes in anxiety and depression is well-established, supported by multiple meta-analyses and randomised controlled trials. The evidence in chronic pain is strong and growing, with flexibility-based interventions showing meaningful effects on pain interference and quality of life even when pain intensity does not change.

In areas like workplace wellbeing, substance use, and psycho-oncology, the evidence is promising but still emerging. Effect sizes tend to be smaller, and many studies have methodological limitations including small samples, lack of active control conditions, and reliance on the AAQ-II as both a process and outcome measure. This circularity is a legitimate concern that the field is actively addressing.

The honest summary: psychological flexibility is one of the best-supported transdiagnostic processes we have. It is not the only one, and the evidence is not uniformly strong across all populations and conditions. But if you are going to track a single process variable, it has the broadest empirical foundation.

From research construct to clinical KPI: making the shift practical

Knowing that psychological flexibility predicts outcomes is one thing. Operationalising it in a clinic with 12 practitioners, 400 active clients, and limited admin capacity is another. Here is how to make it practical.

Choosing your instrument

The AAQ-II is the obvious starting point for most practices. Seven items, roughly two minutes to complete, free to use, and backed by extensive normative data. Its brevity makes it feasible for repeated measurement. The downside is that it is a blunt instrument; it captures general psychological inflexibility but does not differentiate which specific processes are stuck.

The CompACT is the stronger choice if your practice is ACT-oriented and you want actionable clinical data. Its three subscales (openness to experience, behavioural awareness, valued action) map onto the hexaflex processes and can directly inform session planning. The trade-off is length and slightly more complex scoring.

A third option is developing bespoke items tailored to your client population, but this sacrifices the validated normative benchmarks that make standardised instruments useful for tracking and comparison. For most clinics, starting with a validated measure and adding bespoke items later is the pragmatic path.

How often to measure

Annual or even quarterly measurement is too sparse to capture meaningful change trajectories. Research on measurement-based care suggests that routine outcome monitoring is most useful when administered at every session or at minimum every two to four weeks. For a process measure like psychological flexibility, fortnightly administration strikes a reasonable balance between data richness and client burden.

The concern that clients will find repeated questionnaires burdensome deserves a direct response. The evidence suggests the opposite: clients in measurement-based care report feeling more heard, not less. A two-minute questionnaire that a practitioner actually reviews and discusses is experienced as care, not bureaucracy. The burden is usually on the practitioner who does not know what to do with the data, not on the client who fills in the form.

Interpreting scores over time

Individual scores at a single time point are far less useful than trajectories. A client with an AAQ-II score of 24 (moderate inflexibility) who has moved from 32 over eight weeks is telling a very different story from a client who has been stable at 24 for months. Plotting scores over time, even on a simple spreadsheet, transforms a static number into a clinical narrative.

Setting benchmarks requires knowing your population. Normative data for the AAQ-II suggest a mean of approximately 18 to 24 in non-clinical samples and 28 to 35 in clinical samples (Bond et al., 2011). But your clinic's baseline will depend on your client mix. Three months of baseline data collection will give you far more useful benchmarks than any published norm.

Between-session tracking changes the data entirely

Here is where the conversation shifts. Pre and post session measurement, however valuable, captures flexibility at a single moment: the therapy room. But flexibility is not a therapy-room phenomenon. It is what happens on a Tuesday afternoon when a client's manager criticises their work, or on a Sunday evening when anticipatory anxiety about the week builds.

Ecological momentary assessment (EMA) approaches capture psychological states in real time, in real contexts. Rather than asking a client to retrospectively summarise their week, EMA prompts brief check-ins throughout daily life. This addresses the well-documented limitations of retrospective self-report, including recall bias and peak-end effects.

Just-in-time measurement and intervention

Nahum-Shani et al. describe a framework for just-in-time adaptive interventions (JITAIs) that use momentary data to deliver the right support at the right moment. In the context of psychological flexibility, this might mean detecting a period of high experiential avoidance via brief EMA data and offering a targeted defusion exercise in response.

This is not science fiction. The technology exists today. What has been missing is the clinical infrastructure to connect momentary data back to practitioner decision-making. A notification that a client's flexibility scores have dropped meaningfully between sessions is qualitatively different from discovering the same information two weeks later in a session.

Between-session engagement is already recognised as a critical gap in clinical outcomes. As we have explored in depth elsewhere, the between-session gap is where much of the real work of therapy either happens or does not. Adding flexibility measurement to that between-session window does not just improve data quality; it changes what practitioners can see and when they can see it.

Micro-measurement in practice

What does this look like concretely? A client receives a brief daily or every-other-day prompt on their phone: three to five items drawn from validated flexibility measures or tailored to their therapeutic goals. "How willing were you to experience difficult feelings today?" "Did you take action on something that matters to you, even though it felt uncomfortable?" Responses take under a minute.

Over weeks, these micro-measurements produce a data stream that captures the texture of a client's psychological life in ways that a fortnightly questionnaire cannot. Practitioners gain visibility into patterns: flexibility dips on work days, increases after social connection, drops before family visits. This is not replacing clinical judgement. It is feeding it with richer information.

What the data unlocks: clinical decision-making, not just reporting

Tracking psychological flexibility only matters if it changes what clinicians actually do. If flexibility scores go into a database and are never reviewed, you have created admin burden with no clinical return. The goal is not measurement for measurement's sake; it is measurement that informs decisions.

Session planning and course correction

Consider a hypothetical: a small integrative practice with six practitioners tracking CompACT scores fortnightly. One practitioner notices that a client's "openness to experience" subscale has plateaued while "valued action" continues to improve. This signals that the client is behaviourally activated but may be pushing through avoidance rather than genuinely accepting difficult emotions. The practitioner shifts the next session toward experiential acceptance work rather than continuing with values-based behavioural experiments.

Without the data, this pattern might take weeks longer to identify, if it were identified at all. With it, the practitioner makes a targeted adjustment in real time.

Flagging dropout risk

Research consistently shows that clients who are not improving, or who are deteriorating, are more likely to drop out. A sudden decline in flexibility scores between sessions can serve as an early warning system. If a clinic's digital infrastructure surfaces this decline to the practitioner before the next session, there is an opportunity to reach out proactively: a check-in message, an adjusted between-session exercise, or a rescheduled session.

This is where tracking a process variable like flexibility has an advantage over tracking symptom measures alone. Symptoms can fluctuate for many reasons. A decline in flexibility often signals a shift in how a client is relating to their experience, which is both more specific and more actionable.

Supervision and team learning

Aggregate flexibility data across a caseload or across a practice enables a different kind of supervision conversation. Instead of "How is your client doing?" (which relies on practitioner recall and narrative), the conversation becomes "Your caseload's average flexibility scores have improved 12% this quarter, but clients in the chronic pain cohort are plateauing. What are you noticing clinically?" Data-enriched supervision is more specific, more productive, and less reliant on individual memory.

Referral credibility

If your practice accepts referrals from GPs, insurers, or corporate clients, outcome data is increasingly expected. But session counts and satisfaction scores are thin evidence. Being able to demonstrate measurable improvement in a validated, transdiagnostic process measure positions your practice as evidence-informed in ways that differentiate you from competitors who can only report attendance figures.

The integration of blended care models makes this even more powerful. When you can show that your combination of in-session work and between-session digital engagement produces measurable flexibility gains, you are offering referral partners something concrete.

Pitfalls and honest limitations

It would be intellectually dishonest to advocate for flexibility as a KPI without naming the real risks and limitations. Several deserve serious attention.

Goodhart's law

The economist Charles Goodhart observed that when a measure becomes a target, it ceases to be a good measure. If practitioners are evaluated on their clients' flexibility scores, the incentive to teach clients how to "score well" on the AAQ-II rather than genuinely developing flexibility is real. This is not hypothetical. It has happened with symptom measures in systems where outcomes are tied to funding.

The mitigation is cultural, not technical. Flexibility scores should inform clinical conversations, not determine practitioner performance reviews. They are a compass, not a scorecard.

Cultural validity concerns

The AAQ-II and CompACT were developed primarily in Western, English-speaking populations. While translations exist, the conceptual framework of psychological flexibility, particularly around values and acceptance, may carry different meanings across cultures. Acceptance, for example, may be experienced as passive resignation in cultural contexts where active problem-solving is valued, or it may be so deeply embedded in a cultural worldview that measuring it as a distinct construct makes little sense.

Research on the cross-cultural validity of flexibility measures is growing but still insufficient. Clinics serving diverse populations should treat scores as conversation starters, not definitive verdicts, and should remain alert to cultural framings that existing instruments may miss.

Reductionism risk

Psychological flexibility is a rich, multidimensional, contextually embedded construct. Reducing it to a single number, or even three subscale scores, necessarily loses information. There is a real tension between the clinical utility of a simple metric and the nuanced reality of human experience. A CompACT score of 80 tells you something. It does not tell you everything.

The best use of flexibility data is as a complement to clinical observation, not a replacement for it. Numbers open questions. They do not close them.

Practitioner autonomy versus protocol

Some practitioners will resist data-driven approaches on principle, viewing them as an encroachment on clinical autonomy. This concern is worth respecting rather than dismissing. The goal is not to create algorithmic therapy where a score triggers a prescribed response. The goal is to give practitioners better information so their clinical judgement has more to work with. If the data infrastructure is designed to support practitioners rather than surveil them, resistance tends to diminish.

Building flexibility tracking into your practice workflow

If you are persuaded that tracking psychological flexibility is worth trying, the worst thing you can do is attempt a whole-practice rollout on day one. Here is a more realistic approach.

Step one: start small

Choose one instrument. For most practices, the AAQ-II is the right starting point: it is brief, validated, and your team can learn to interpret it quickly. Select one cohort of clients, ideally a group where ACT processes are already central to the clinical approach. Run for three months before evaluating.

Step two: integrate into existing touchpoints

Embed the measure into your intake process as a baseline. Add it to regular review sessions (every two to four weeks). If you already use a pre-session check-in process, add the seven AAQ-II items to it. The less disruption to existing workflow, the higher the adoption.

Step three: make the data visible

Scores that live in a filing cabinet or a forgotten spreadsheet column do not change clinical behaviour. Practitioners need to see flexibility trajectories at the point of care: before a session, during supervision, during case reviews. This is where digital infrastructure matters. Platforms designed for measurement-based care can automate collection, calculate trajectories, and surface trends without requiring manual data entry. Afterglow, for instance, integrates between-session engagement tools with outcome tracking so that flexibility data flows naturally alongside the client's daily practice, rather than existing as a separate administrative task.

Step four: close the feedback loop

Share flexibility data with clients. This is not about turning therapy into a data dashboard. It is about collaborative transparency. "I noticed your willingness to experience difficult emotions has been increasing steadily over the last month, even though you have been reporting more anxiety. What do you make of that?" This kind of conversation is only possible when the data exists.

Step five: review and iterate

After three months, assess: Is the data informing clinical decisions? Are practitioners finding it useful or burdensome? Are clients engaging with repeated measurement? Adjust frequency, instrument choice, or cohort based on what you learn. Consider expanding to the CompACT for practitioners who want more granular process data. Consider adding between-session micro-measurement for clients who are digitally engaged.

Where this is heading: flexibility as the common currency of integrative care

Step back from the practical details for a moment and consider a bigger picture. Integrative clinics increasingly blend modalities: ACT alongside IFS, somatic approaches, mindfulness-based interventions, elements of CBT. This is clinically appropriate for many clients. But it creates a measurement challenge. How do you track outcomes when different modalities use different frameworks and different language?

Psychological flexibility may offer an answer. It is not owned by ACT, even though ACT gave it its most rigorous articulation. The core processes of flexibility, being present, open, and engaged in valued action, overlap substantially with goals across modalities. IFS aims to help clients access Self-energy and unblend from protective parts, which maps closely onto self-as-context and defusion. Somatic approaches often target the body-based avoidance patterns that acceptance work addresses. Even CBT's cognitive restructuring, when done well, produces a form of defusion.

This makes psychological flexibility a plausible "common currency" for integrative practices. A single, validated, transdiagnostic process measure that all practitioners can track regardless of their primary modality. The potential for cross-clinic benchmarking, anonymised and aggregated, is significant. Imagine a network of integrative practices able to compare flexibility trajectories across different blended care models, identifying which combinations of modalities and between-session tools produce the most meaningful change for different client profiles.

We are not there yet. But the infrastructure is being built. The practices that begin tracking flexibility now will have years of baseline data by the time the field catches up. They will be the ones who can demonstrate their value with evidence, not anecdotes.

The deeper shift is this: the behavioural health field is moving, slowly but unmistakably, from valuing inputs (sessions delivered, hours billed) to valuing outcomes (meaningful change in clients' lives). When that shift reaches its tipping point, the clinics that have been quietly measuring what actually matters will not need to scramble. They will already be there.

The question is not whether your clinic will eventually track therapeutic process. It is whether you will be leading that conversation or catching up to it.

References

Bond, F. W., Hayes, S. C., Baer, R. A., Carpenter, K. M., Guenole, N., Orcutt, H. K., Waltz, T., & Zettle, R. D. . Preliminary psychometric properties of the Acceptance and Action Questionnaire-II: A revised measure of psychological inflexibility and experiential avoidance. Behavior Therapy, 42(4), 676-688.
Francis, A. W., Dawson, D. L., & Golijani-Moghaddam, N. . The development and validation of the Comprehensive assessment of Acceptance and Commitment Therapy processes (CompACT). Journal of Contextual Behavioral Science, 5(3), 134-145.
Hayes, S. C., Luoma, J. B., Bond, F. W., Masuda, A., & Lillis, J. . Acceptance and commitment therapy: Model, processes and outcomes. Behaviour Research and Therapy, 44(1), 1-25.
Kashdan, T. B., & Rottenberg, J. . Psychological flexibility as a fundamental aspect of health. Clinical Psychology Review, 30(7), 865-878.
Nahum-Shani, I., Smith, S. N., Spring, B. J., Collins, L. M., Witkiewitz, K., Tewari, A., & Murphy, S. A. . Just-in-time adaptive interventions (JITAIs) in mobile health: Key components and design principles for ongoing health behavior support. Annals of Behavioral Medicine, 52(6), 446-462.