Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
ISO/IEC 25066:2014 defines the Common Industry Format (CIF) for usability evaluation reports as part of the SQuaRE (Systems and software Quality Requirements and Evaluation) framework. It provides a standardized structure for documenting usability test results, making it possible to compare findings across studies, products, and organizations. For UX engineers and quality assurance teams, adopting the CIF means fewer ambiguities in handoffs and more reproducible evaluation workflows.
The CIF mandates a report structure with seven major sections: (1) executive summary, (2) product description, (3) evaluation context, (4) evaluation methodology, (5) data analysis and results, (6) findings and recommendations, and (7) appendices. Each section serves a distinct role and collectively ensures that a third party can reproduce the evaluation without referring to external documents.
From a design engineering perspective, the most critical section is “evaluation methodology.” It must specify the participant profile (sample size, inclusion criteria, domain experience), the tasks selected for testing, the test environment (laboratory, remote, field), and the metrics collected. Without this detail, the numerical results are essentially meaningless — a 90% task-completion rate in a lab with expert users is not comparable to the same number collected from novice users in the field.
| Section | Required Elements | Engineering Value |
|---|---|---|
| Executive Summary | Goals, key findings, severity ratings | Quick stakeholder alignment |
| Product Description | Target user profile, key functions, hardware/software context | Establishes scope boundaries |
| Evaluation Context | Use scenarios, environmental conditions, constraints | Enables reproducibility |
| Methodology | Participant criteria, task list, metrics, data collection tools | Core of scientific validity |
| Results | Effectiveness, efficiency, satisfaction data | Quantitative evidence base |
| Findings & Recommendations | Root cause analysis, prioritized fixes | Actionable engineering output |
| Appendices | Raw data, consent forms, task scripts | Audit trail |
The CIF organizes usability measurement around three pillars defined in ISO 9241-11: effectiveness, efficiency, and satisfaction. Effectiveness is captured through task completion rates, error counts, and help-system invocations. Efficiency is typically measured as time-on-task or clicks-per-task. Satisfaction is gathered via standardized questionnaires such as SUS (System Usability Scale), QUIS, or custom Likert scales.
An engineering insight worth emphasizing: the CIF does not prescribe a single metric for any pillar. Instead, it encourages evaluators to select metrics that are most sensitive to the product’s risk profile. For a medical device UI, error severity may outweigh time-on-task; for an e-commerce checkout flow, task-completion rate is king. Defining these priorities before data collection avoids post-hoc cherry-picking.
| Use Context | Primary Metric | Secondary Metric | Minimum Sample |
|---|---|---|---|
| Medical device UI | Error severity (count × harm level) | Task completion | 15–20 per segment |
| Consumer mobile app | Time-on-task | SUS score | 12 per segment |
| Enterprise dashboard | Task completion rate | Clicks-per-task | 8–10 per segment |
| Safety-critical HMI | Response time deviation | Error rate | 20+ per segment |
Integrating the CIF into a CI/CD pipeline is an emerging practice. By instrumenting formative usability tests with automated logging (clickstreams, session recording, task-timing middleware), teams can generate CIF-compliant report drafts programmatically. The structured nature of the CIF makes it an excellent target for template-driven report generators.
Another key insight: the CIF’s recommendation section is the primary vehicle for driving design changes. Each recommendation should be tagged with a severity level (critical, major, minor) and linked to specific raw data points. This transforms the evaluation report from a mere record into a traceable requirements document that product owners and developers can act upon.