Cambridge OCR Physics Exam Errors and QA Workflows

An exam paper is not only a document.

It is an information product.

Behind every question there is source material, subject expertise, data, diagrams, tables, formulas, mark schemes, checking, sign-off, and correction routes. If those pieces do not line up, the final exam paper can look official while still being wrong.

That is why the Cambridge OCR physics exam errors matter beyond education.

Cambridge OCR was fined £270,000 by Ofqual after errors were identified in A-level and AS-level physics exam papers and mark schemes. Reporting said there were 12 errors between April and October 2025. Some were caught before students sat the exam. Others were not. The reported errors included wrong data in a table, spreadsheet data not matching graphs, and mistakes that contributed to some students receiving incorrect grades.

This is not an AI failure story. It is not a complex algorithm story. It is a workflow story.

The issue sits in a more ordinary but serious place: assessment content, spreadsheet checking, mark scheme quality assurance, review escalation, and the use of flawed material in high-stakes decisions.

If your team produces reports, dashboards, public submissions analysis, evidence tables, assessment material, donor outputs, or decision notes, the lesson is direct. A clean-looking output is not enough. The workflow behind it needs to be checked. If you need help reviewing the route from raw information to final output, start with a data collection and intake system, traceable evidence workflow support, or a data use, reporting and communication system.

Who this guide is for

This guide is for: Teams producing high-stakes reports, assessment material, dashboards, public submissions analysis, evidence tables, and decision notes.

Key takeaways

A clean-looking exam paper, report or evidence table can still carry unchecked source errors.
QA needs to test the chain from source data to graphs, mark schemes and final outputs.
Correction routes should be clear before people rely on high-stakes material.
Source traceability makes errors easier to detect, explain and correct.

The current situation

The Times reported that Cambridge OCR was fined £270,000 after Ofqual found unacceptable errors in A-level and AS-level physics papers and mark schemes.

The errors were identified between April and October 2025. Some were spotted before the exams and correction notices were issued. Some were identified before results were released, allowing action to be taken. Others went further through the system and contributed to incorrect grades being issued.

The reported mistakes included wrong data in a table and spreadsheet data not matching graphs.

That detail matters.

This was not only a wording mistake. It involved the kinds of source data, spreadsheet outputs, visual evidence, and marking guidance that students and examiners rely on during a high-stakes assessment process.

Ofqual also reportedly said Cambridge OCR had failed to ensure paper content was fit for purpose and had failed to have clear arrangements in place for schools to request mark adjustments because of errors.

So there are two layers to the failure.

The first layer is the quality of the exam material itself. The second layer is the correction and escalation route when errors are found.

Both are workflow problems.

What happened

The reported errors appeared across exam papers and mark schemes.

Some mistakes were caught before the exam and communicated to schools. That suggests there was some review and correction process in place.

But some errors were not caught before students sat the papers. Some were only identified later. Some affected the marking and grading process.

That is the important point.

A high-stakes information workflow cannot rely on one final check. It needs checks at multiple points:

when source data is selected

when tables are created

when graphs are produced

when questions are written

when mark schemes are drafted

when spreadsheet calculations are checked

when the paper is reviewed by subject experts

when corrections are issued

when marking adjustments are applied

when results are checked before release

If one of those points fails, the next point should catch the problem.

When several points fail, the error reaches the student.

The main point

The Cambridge OCR physics exam errors show that quality assurance is not a final proofreading step.

Quality assurance is the whole route from source material to final use.

In this case, the final product was an exam paper and mark scheme. In another organisation, the final product might be a report, dashboard, submission analysis, briefing note, AI summary, donor update, or public-facing document.

The same principle applies.

If the source data is wrong, the graph may be wrong. If the graph is wrong, the question may be wrong. If the question is wrong, the mark scheme may be wrong. If the mark scheme is wrong, the grade may be wrong. If the grade is wrong, the student may be affected.

The harm does not start at the final output. It starts earlier in the workflow.

That is why source traceability in evidence-heavy reports matters. It gives the team a way to work backwards from a claim, table, result, or recommendation to the material behind it.

Where the workflow failed

1. Source data and question data were not checked tightly enough

In an exam paper, a data table is not decoration. It is evidence that the student must use to answer the question.

If the table is wrong, the student is not only facing a typo. They are working from faulty information.

The same applies to reports, dashboards, consultation summaries, and donor outputs. A table may look clean, but that does not mean the underlying data is correct.

This is why a good workflow starts before the final document is written. The team needs a clear route for checking:

where the data came from

whether the data has been copied correctly

whether formulas have changed the result

whether tables match the source

whether graphs match the table

whether the final output still reflects the original evidence

For teams collecting information from forms, fieldwork tools, public submissions, partner reports, or spreadsheets, this is the purpose of a data collection and intake system. The goal is not only to capture information. The goal is to capture it in a way that can be checked and used later.

2. Spreadsheet outputs and graphs did not stay aligned

One of the reported issues was spreadsheet data not matching graphs.

That is a familiar workflow failure.

A graph may be built from an older data range. A spreadsheet may be updated after the chart is created. A formula may change. A row may be added. A cell may be overwritten. A chart may look right, but no longer match the source.

This is not only a spreadsheet problem. It is a version control and review problem.

Any workflow that uses tables, charts, dashboards, formulas, or evidence matrices needs a way to check whether the output still matches the source data.

That can include:

locked source tabs

separate raw and processed data tabs

named ranges

data dictionaries

formula checks

chart-source checks

review status fields

change logs

final-output QA checks

If the spreadsheet is feeding a report, exam, dashboard, or decision note, the spreadsheet is part of the evidence trail.

This is why a source-linked evidence table is useful in evidence-heavy work. It does not only collect material. It keeps the route between source, table, claim, and output visible.

3. The mark scheme depended on the same weak workflow

An exam question and its mark scheme are linked. If the question contains faulty data, the mark scheme may need to change. If the mark scheme does not reflect the error, examiners may mark students unfairly.

That is the second-order problem.

A mistake in the source material can move into the marking process. Once that happens, the error is no longer only in the paper. It is in the system that turns student answers into marks.

In reporting work, the equivalent is a weak finding being copied into a recommendation matrix. Or a summary being used in a board pack. Or an AI-generated answer being used in a client report without checking the source.

This is why traceable evidence workflow support focuses on keeping sources, claims, quotes, themes, findings, recommendations, and review notes connected. The danger is not only that one line is wrong. The danger is that the wrong line becomes part of the decision process.

4. Corrections did not catch everything early enough

Some errors were reportedly identified before students sat the exam. Schools received correction notices. Other errors were identified before results were released. In those cases, action could be taken before the final outcome reached students.

But not everything was caught early enough.

That shows the difference between having a correction process and having a strong correction process.

A useful correction process needs to answer practical questions:

Who can flag an error?

Where is the error logged?

Who owns the response?

Which exam paper, question, table, mark scheme, or result is affected?

How is the correction communicated?

How is the mark scheme updated?

How is the final result checked?

How does the organisation make sure the same type of error does not repeat?

The same applies outside education. If your team finds an error in a dataset, report, public submission matrix, AI output, or dashboard, the issue needs a correction route. Otherwise the correction sits in emails, tracked changes, or meeting notes and never becomes part of the system.

A source register for an evidence-heavy report can help with this because it gives the team one place to track where material came from, how it was used, and what review status it has.

5. The impact reached students

The most serious part is not the spreadsheet mismatch or the wrong table. It is the effect on students.

Students spend years preparing for these exams. A-level results can affect university offers, subject choices, bursaries, family planning, and confidence in the system.

When an exam paper contains errors, the student has to work out whether the problem is their understanding or the question itself. That creates anxiety during the exam. When errors affect grades, the problem becomes even more serious.

This is why high-stakes data workflows need stronger quality control than ordinary admin processes.

When the output affects people’s lives, the workflow behind that output needs to be defensible.

What should have happened

No assessment system can remove every possible error. But a stronger workflow can reduce the chance that errors reach students or affect grades.

Source data should have had a clear verification route

Any table, graph, calculation, or dataset used in an exam paper should be traceable back to the source file.

That route should show:

the original data

the spreadsheet or calculation used

the table created from it

the graph created from the table

the question using the graph or table

the mark scheme linked to the question

the review status of each item

This is the same logic used in source traceability work. The point is not to create admin for its own sake. The point is to make the final output checkable.

Spreadsheet and graph checks should have been part of QA

A spreadsheet can be technically correct at one point and wrong later if the connected output is not checked again.

For this kind of workflow, QA should include a direct table-to-graph check before final sign-off.

That check should not be a vague “review the paper” step. It should be a named review task with a clear owner, expected check, and status.

For organisations producing reports, dashboards, public consultation summaries, or donor outputs, the same principle applies. If a graph or table appears in the final output, someone should be able to show which data it came from and when it was checked.

The Source Traceability Risk Checker is useful for teams that want to test whether their current process can survive this kind of review.

Mark schemes should have been checked against the final paper

A mark scheme should not only be reviewed in isolation. It should be checked against the final version of the question paper.

That check matters because the paper and mark scheme can drift apart during edits.

A graph may be changed. A table may be corrected. A question may be reworded. A value may be adjusted. If the mark scheme is not checked against the final version, the answer logic may no longer fit the assessment material.

In evidence and reporting work, this is similar to checking whether a recommendation still matches the evidence table after edits. It is also similar to checking whether a report paragraph still reflects the source after review comments have changed the wording.

A findings-to-recommendations matrix helps avoid this problem by keeping findings, evidence, recommendations, and review notes connected.

Error escalation should have been easier to operate

Ofqual reportedly criticised Cambridge OCR for not having clear arrangements in place for schools to request mark adjustments because of errors.

That is a workflow issue.

When an error is found, the organisation needs a clear escalation route. It should not depend on ad hoc judgement, scattered emails, or unclear responsibility.

For any high-stakes process, the error route should include:

error ID

source of the error

affected material

affected users

action taken

owner

deadline

decision made

communication sent

final review status

A correction process is only useful if people know how to trigger it and the organisation can track what happened.

Final sign-off should have tested the whole chain

The final check should not only ask whether the document looks finished.

It should ask whether the chain holds.

For an exam paper, that means checking whether source data, graphs, tables, questions, mark schemes, correction notices, and final marks still align.

For a report, that means checking whether sources, quotes, claims, findings, recommendations, and final wording still align.

For a dashboard, it means checking whether raw records, formulas, filters, charts, and displayed totals still align.

For an AI knowledge base, it means checking whether answers can be traced back to approved source material. That is why a QA process for an AI knowledge base is needed before a team starts using it for real decisions.

What this shows about weak QA workflows

The Cambridge OCR physics exam errors show that QA is not the last step before publication.

QA is the operating system behind the whole information product.

The risks are familiar:

source material is copied without a check

spreadsheets are changed after graphs are created

mark schemes drift from questions

correction notices are not enough to solve the downstream issue

unclear escalation routes slow the response

final outputs are trusted because they look official

This does not only happen in education.

A donor report can contain a wrong figure. A public consultation summary can misclassify stakeholder input. A dashboard can display a total from the wrong filter. A policy briefing can cite the wrong source. An AI assistant can summarise the wrong document. A board pack can carry forward an old number.

The pattern is the same.

Information comes in. It gets processed. It becomes an output. People rely on it.

If the workflow between those points is weak, the final output can become a polished error.

How a stronger workflow could have reduced the risk

A stronger workflow would not guarantee that no exam paper ever contains a mistake.

But it could reduce the chance that a table error, spreadsheet mismatch, mark scheme issue, or correction problem reaches students or affects grades.

A stronger workflow would include:

source data tracking

spreadsheet version control

table-to-graph checks

paper-to-mark-scheme checks

named review owners

error logs

correction routes

affected-question tracking

communication records

final sign-off against source material

post-incident review that changes the workflow, not only the document

That is the practical lesson for any organisation producing high-stakes outputs.

The issue is not only whether the final document is well written. The issue is whether the route to that document can be checked.

If your team spends too much time searching, checking, and reconstructing how outputs were produced, the Search and Review Time Savings Calculator can help estimate the hidden time cost.

If the issue is repeated reporting work, the Reporting Bottleneck Cost Calculator can help show where manual review and correction work may already be costing the team.

What this means for research, policy, reporting, and evidence work

The Cambridge OCR case is about physics exams, but the same workflow risk appears in other evidence-heavy settings.

In public consultation work, submissions may be coded into themes. If the coding structure is weak, the final synthesis may misrepresent the public record.

In donor reporting, partner updates may be copied into a report. If source IDs and review notes are missing, the team may struggle to defend a finding.

In research synthesis, interview quotes may be moved into a report section. If the quote bank is not linked to source records, the reviewer may not know where the evidence came from.

In internal reporting, dashboards may pull from spreadsheets. If formulas, filters, and field names are not checked, the dashboard can create false confidence.

In AI-supported work, a custom assistant may produce a useful-looking answer. If the source material is messy, the answer may still be weak. That is why AI often gives weak answers when source material is messy.

The lesson is simple: the final output is not the whole system.

The system is the route from raw information to usable output.

Where my work fits

My work helps teams move from messy information to structured systems, faster analysis, clearer reports, and better decisions.

That can include:

intake workflows

source registers

evidence databases

source-linked evidence tables

quote banks

coding frameworks

findings matrices

QA checklists

AI knowledge base testing

report-ready outputs

handover notes

review workflows

The aim is not to add complexity. The aim is to make the path from source material to final output easier to check.

My Local Government White Paper evidence workflow case study shows how public submissions, claims coding, synthesis, drafting support, and review comments can stay connected instead of being split across folders, notes, tracked changes, and one-off summaries.

The UNICEF Zambia child poverty evidence workflow shows how 120 narrative case studies were structured into a spreadsheet-first evidence workflow with AI-assisted coding and quote-per-claim checks.

The UNICEF Palestine situation analysis case study shows how scattered qualitative material can be rebuilt into a retrieval, recommendation, and drafting workflow under deadline pressure.

Different context. Same principle.

The output is only as strong as the workflow behind it.

FAQ

What happened with Cambridge OCR physics exam errors?

Cambridge OCR was fined £270,000 by Ofqual after errors were identified in A-level and AS-level physics exam papers and mark schemes. Reporting said there were 12 errors between April and October 2025.

Some were caught before exams. Others were not. Some students were reportedly issued incorrect grades.

How much was Cambridge OCR fined?

Cambridge OCR was fined £270,000 by Ofqual.

What were the Cambridge OCR physics exam errors?

Reported errors included wrong data in a table, spreadsheet data not matching graphs, and mistakes in papers and mark schemes.

These are not only proofreading issues. They point to weaknesses in the way source data, spreadsheets, graphs, questions, mark schemes, and review processes were checked.

Were students affected by the Cambridge OCR physics errors?

Yes. Reporting said the failures caused anxiety for students and that some students were issued incorrect grades.

Why do exam paper errors matter?

Exam paper errors matter because students rely on the paper during a high-stakes assessment. If the question, table, graph, or mark scheme is wrong, the student may be disadvantaged or placed under unnecessary stress.

The final grade may also affect university applications, subject choices, and future plans.

Was this an AI failure?

No. This case was not about AI. It was about assessment content, source data, spreadsheet checking, mark scheme QA, review escalation, and correction workflows.

That makes it useful for organisations outside education because the same type of workflow failure can happen in reports, dashboards, public submissions analysis, donor reporting, and internal data systems.

What is the workflow lesson from the Cambridge OCR fine?

The lesson is that quality assurance needs to cover the whole route from source material to final output.

For an exam paper, that means source data, tables, graphs, questions, mark schemes, corrections, and final grades.

For a report or evidence workflow, it means source material, coded evidence, findings, recommendations, review notes, and final wording.

How can organisations avoid this kind of QA failure?

Organisations can reduce the risk by using clear source tracking, data dictionaries, version control, table-to-graph checks, review logs, named QA owners, correction workflows, and final sign-off against source material.

A good place to start is checking whether the team can trace final outputs back to the source records behind them.

What is source traceability?

Source traceability means being able to link a final output back to the material it came from.

In a report, that may mean linking a finding back to a quote, interview, case study, submission, spreadsheet row, or source document.

In an exam paper, it may mean linking a table, graph, question, and mark scheme back to the source data and review record.

Why do spreadsheets create QA risk?

Spreadsheets create QA risk when formulas, charts, ranges, tabs, or copied values change without proper review.

A graph may no longer match the source data. A table may be copied from an old version. A formula may produce a result that no one checks. That is why spreadsheet-based workflows need review rules, version control, and clear ownership.

How does this relate to evidence-heavy reports?

Evidence-heavy reports can fail in similar ways. A finding may not match the source. A recommendation may drift away from the evidence. A quote may be copied without a source ID. A chart may not match the spreadsheet.

That is why traceable evidence workflow support is useful before a report reaches final review.

Who can help audit a QA or evidence workflow?

I help teams audit and rebuild workflows that move from raw information to reports, dashboards, evidence tables, public submissions analysis, AI outputs, and decision support.

If your team needs to check whether its current process can survive review, correction, or public scrutiny, contact me here.

A useful next step

If your team produces reports, evidence tables, dashboards, submissions analysis, AI outputs, public-facing documents, or decision notes, ask three questions:

Can we trace every important output back to the source material?

Can we explain how the data, table, graph, finding, or recommendation was produced?

Can we detect and correct an error before people rely on the output?

If the answer is no, the issue is not only quality assurance. It is workflow design.

A weak workflow can become a correction exercise, a trust problem, a regulatory issue, or an expensive delivery failure. If your team needs to check whether its data, evidence, AI, or reporting workflow is strong enough, get in touch. I can help audit the current process, identify the main risk points, and build a clearer route from source material to final output.

Sources used in this guide

These sources were used to ground the facts of the case. The workflow analysis is my own.

Methodology and guidance

The Times report

Used for factual background and source context.

Read source

Process Failure Case Studies

Data Collection & Intake Systems

Collect useful, traceable data from the start through forms, fieldwork tools, public submission portals, partner reporting systems, calculators, and intake workflows.

Service fit

Relevant service fit

This article sits inside the same delivery work, service logic, and practical outcomes shown across the site.

Data Collection & Intake Systems

Collect useful, traceable data from the start through forms, fieldwork tools, public submission portals, partner reporting systems, calculators, and intake workflows.

Data Use, Reporting & Communication Systems

Use structured data in reports, dashboards, internal tools, public microsites, applications, presentations, annual reports, and decision-support workflows.

Traceable Evidence Workflow Support

Turn interviews, submissions, case studies, survey comments, documents, and field notes into coded evidence, quote banks, synthesis tables, findings, recommendations, and report-ready outputs.

Delivery examples

Related case studies

These delivery examples share the same service mix or workflow focus as the article you just read.

Next reads

Read the adjacent stage in the workflow.

Calculators

Relevant calculators

If this reflects a live bottleneck in your workflow, these tools can help you put rough numbers around it.

Softer next step

Not ready to send a brief yet?

Join the newsletter for practical notes on messy information, evidence workflows, source traceability, reporting pressure, and AI use that needs structure.

Join the newsletter Read the topic hub

Need help with a similar problem?

If this article reflects the kind of reporting, systems, or evidence challenge you are dealing with, send a short brief and I can help scope the right next step.

Who this guide is for

Key takeaways

The current situation

What happened

The main point

Where the workflow failed

1. Source data and question data were not checked tightly enough

2. Spreadsheet outputs and graphs did not stay aligned

3. The mark scheme depended on the same weak workflow

4. Corrections did not catch everything early enough

5. The impact reached students

What should have happened

Source data should have had a clear verification route

Spreadsheet and graph checks should have been part of QA

Mark schemes should have been checked against the final paper

Error escalation should have been easier to operate

Final sign-off should have tested the whole chain

What this shows about weak QA workflows

How a stronger workflow could have reduced the risk

What this means for research, policy, reporting, and evidence work

Where my work fits

FAQ

A useful next step

Sources used in this guide

Data Collection & Intake Systems

Relevant service fit

Related case studies

Policy Evidence Workflow for a Local Government White Paper

Child Poverty Evidence Workflow for a UNICEF Report Project in Zambia

Situation Analysis Recovery for a UNICEF Report Project in Palestine

Next reads

How to Build a Source Register for an Evidence-Heavy Report

How to Prepare a Findings-to-Recommendations Matrix

How to QA an AI Knowledge Base Before a Team Starts Using It

Relevant calculators

Reporting Bottleneck Cost

Search and Review Time Savings

Source Traceability Risk Checker

Not ready to send a brief yet?

Need help with a similar problem?