An excerpt from the Australian Medical Council’s 2023 annual report
Working with admin data
Questions to consider
What do the variables represent?
i.e. do the measure what you think they measure.
How were they collected?
Who does the data entry, how might this affect the data.
Who is represented in the data?
Is it representative of your target population?
What are the contexts affect the data?
Policy, legislation, other changes over time.
Data context
Bilson, A., Cant, R.L., Harries, M. and Thorpe, D.H., 2017. Accounting for the increase of children in care in Western Australia: What can a client information system tell us?. Child Abuse & Neglect, 72, pp.291-300.
Data dictionary
Describes the data contained within a dataset, such as:
Variable names
Descriptions
Values they can take on
Other things to note
If a dataset does not have a data dictionary, consider reaching out to the custodian if something is not clear.