Concept help - Data Set
Table of Contents
- Help
- Available fields
-
Custom fields
- Asset Type
- Asset Status
- Security Classification
- Data Sensitivity Type
- Data Owner
- Data Steward
- Subject Matter Expert 1
- Subject Matter Expert 2
- Subject Matter Expert 3
- External Custodian
- Purpose
- Data Quality Statement
- Data Notes
- Data Collection Details
- Data Sharing Details
- Legal Authority
- Permitted Primary Purpose
- Permitted Secondary Purpose
- Data Access Details
- Publications and Outputs
- Asset Documentation
- Connected Systems
- Keyword
- Alert Message
- Admin Only
- Data Custodian
- Official Definition
A a dataset in DCAT is defined as a "collection of data, published or curated by a single agent, and available for access or download in one or more formats". A dataset does not have to be available as a downloadable file. For example, a dataset that is available via an API can be defined as an instance of dcat:Dataset and the API can be defined as an instance of dcat:Distribution. DCAT itself does not define properties specific to APIs description. These are considered out of the scope of this version of the vocabulary. Nevertheless, this can be defined as a profile of the DCAT vocabulary.
Fields available on this metadata type
Field | ISO definition |
---|---|
Name | The primary name used for human identification purposes. |
Definition | Representation of a concept by a descriptive statement which serves to differentiate it from related concepts. (3.2.39) |
Is Federated | |
Is Not Federable | |
Version | Unique version identifier of this metadata item. |
References | Significant documents that contributed to the development of the metadata item which were not the direct source for the metadata content. |
Origin | The source (e.g. document, project, discipline or model) for the item (8.1.2.2.3.5) |
Comments | Descriptive comments about the metadata item (8.1.2.2.3.4) |
Deleted | The date after which the item has been soft deleted and is no longer visible in the registry |
License | Information about the license document under which the dataset is made available. |
Rights | Information about rights held in and over the dataset. |
Release Date | Date of formal publication of the dataset. |
Modification Date | Most recent date on which the dataset was changed, updated or modified. |
Frequency | The frequency at which dataset is published. |
Spatial Coverage | Spatial or geographic coverage of the dataset. |
Temporal Coverage | The temporal or time period that the dataset covers. |
Catalog | An entity responsible for making the dataset available. |
Landing Page | A Web page that can be navigated to in a Web browser to gain access to the dataset, its distributions and/or additional information |
Contact Point | Relevant contact information for the Dataset. |
Conforming Specification | An established standard to which the described resource conforms. |
Item Base |
Custom Fields
Field | Short definition | Long definition | |||
---|---|---|---|---|---|
Asset Type | [Core] The type of data asset being described |
Information may be held as different content types. Information assets should be identified by type to enable assets to be assessed and planned for, as different content types will differ in business impact and value.
|
|||
Asset Status | This field indicates the status of the underlying data asset. |
This is does not refer to the status of the data asset registration within the IAR. The meaning of each status is as follows:
|
|||
Security Classification | This field is used to indicate the security classification of the dataset, helping to identify the level of access and protection required. Please select the appropriate classification label to ensure that users understand how the data in the dataset should be handled in accordance with NSW Government policies. |
The 'Security Classification' field records the NSW Government's classification label that has been deemed appropriate for the data in this dataset. These labels include the NSW Government's Dissemination Limiting Markers (DLM). This classification is important for users to ensure that sensitive information is protected and shared in compliance with NSW Government policies. When selecting a classification, consider the content of the data and its potential impact if disclosed. For more information on determining an appropriate classification, see the NSW Government Information Classification, Labelling and Handling Guidelines - Sensitive Information. |
|||
Data Sensitivity Type | This field indicates the sensitivity of the data concerning health and personal information. Please select the appropriate option to specify whether the asset contains health information, personal information, both, or neither. |
This is used to categorise the sensitivity of the data in relation to health and personal information. It helps ensure that appropriate handling and protection measures are applied according to the nature of the data. What is personal information? The Privacy and Personal Information Protection Act 1998 (PPIPA) provides for the protection of personal information held by government agencies. Personal Information is information or an opinion (including information or an opinion forming part of a database and whether or not recorded in a material form) about a person whose identity is apparent or can reasonably be ascertained from the information or opinion. It is not restricted to information that clearly identifies a person but may include information which leads to the identification of an individual when considered in association with other available information. It covers information held in paper or electronic records and extends to images, body samples and biometric data such as fingerprints. There are a number of exceptions to the definition of personal information. Those most relevant to information held by the department are information about:
What is health information? The Health Records and Information Privacy Act 2002 (HRIPA) regulates the collection and handling of health information by public sector agencies and private organisations. Health information is personal information that is information or an opinion about an individual’s physical or mental health or disability or the provision of a health service to an individual. It includes personal information about an individual collected in connection with the donation of body parts and genetic information collected in providing a health service that is or could be predictive of the health of that individual or a genetic relative of the individual. It includes healthcare identifiers. The HRIPA defines personal information on the same terms as the PPIPA with substantially the same exceptions as that Act. Health information, therefore, does not include information about:
|
|||
Data Owner | The role title of the position accountable for the asset within the organisation. |
|
|||
Data Steward | The role title of the person nominated by the Data Owner to be responsible for the operational management of the asset. |
The Data Steward has detailed and expert knowledge of their data and provides advice on appropriate use and interpretation. Where there is more than one data steward, include only the primary data steward role title in this field. Do not include the person's name. |
|||
Subject Matter Expert 1 | Provide the position or name of the Subject Matter Expert. Usually advised by the Steward. | ||||
Subject Matter Expert 2 | Use this field where there is more than one subject matter expert for the asset. | ||||
Subject Matter Expert 3 | Use this field where there is more than one subject matter expert for the asset. | ||||
External Custodian | Where the data is sourced from an external agency, list the agency who owns the data, and preferred contact details. | ||||
Purpose | A descriptive summary of the intentions with which the asset was developed. |
A descriptive summary of the intentions which the data asset was developed and proposed to be used for. This is why the data was collected in the first place, including what the data is used for from a business perspective. Please do not repeat the information in the description. See below for suggested text for this field.
|
|||
Data Quality Statement | A link to the data quality statement |
A link to the data quality statement. This field is created by the Department of Education. |
|||
Data Notes | Use this field to provide additional context and insights about the dataset. Include any limitations, considerations, or relevant information that can help users understand the dataset's utility and constraints. |
This field is intended for capturing essential context and considerations regarding the dataset that may not be covered in other fields. Use this space to outline any limitations or restrictions on the use and analysis of the data, as well as any important details that users should be aware of to fully understand its applicability. The information could include:
Providing comprehensive notes in this field will help users make informed decisions about the dataset's relevance and suitability for their purposes, ultimately enhancing the data's value and usability. See below for examples:
|
|||
Data Collection Details | This field is used to record how the data is collected. Include information that will help the user understand how the data was collected, including the methods, systems, or processes used to gather the data. |
Use this field to capture information about how the dataset is collected. If the collection details are available elsewhere, use this field to provide a link to these details. These details may include specific methods used (e.g., surveys, interviews, annual collections), the systems employed for data collection (e.g., databases, software applications), and the processes followed to collect the data. Providing detailed information on data collection is important for understanding the dataset's context and quality. When filling out this field, be as descriptive as possible to give users a clear understanding of the data collection approach and its implications for analysis and use. |
|||
Data Sharing Details | Capture information about any agreements related to the sharing of data within the Department or with external organisations/agencies (both incoming and outgoing data). Please specify any details regarding the conditions under which the data is shared with external parties, and any links or references to the data sharing agreement. |
Use this field to document any agreements or arrangements governing the sharing of the data with external organisations or agencies. This includes any contracts, data sharing agreements, ministerial directives, or memorandum of understandings (MoUs) that outline the terms for sharing the data with external parties. In this field, you should provide:
If the data is shared externally, the suggested text is:
If the data is not shared internally or externally, the suggested text is:
|
|||
Legal Authority | The applicable legal authorities under which the organisation is permitted to collect, create, receive, use, or disclose the data. Please specify the relevant legislation, policies, or agreements that govern the handling of this data. |
This 'Legal Authority' field captures all legal mandates pertaining to the collection, creation, receipt, use, or disclosure of the data asset. See below for suggested text for this field.
|
|||
Permitted Primary Purpose | This field is used to capture the permitted primary purpose for which the personal information was collected. |
This relates to ss 17 & 18 of the (NSW) PPIPA and Sch 1, ss 10 & 11 of the HRIPA. (Limits on use and disclosure of personal information). See below for the suggested format for this field.
|
|||
Permitted Secondary Purpose | A secondary purpose is directly related to the purpose for which personal information was collected. Where appropriate, use the suggested text below to populate this field. |
If the data does not contain health information or personal information, include the below text:
|
|||
Data Access Details | Use this field to provide information on how users can access the data, including necessary links, request procedures, and general access rules. |
This field provides users with information on how to access the data, including any existing publications or reports that may meet their needs prior to making a data request.
|
|||
Publications and Outputs | Use this field to record the process or conditions required for any release of publications or outputs from using the data. |
“Publication” refers to any method that will distribute data or information gained from the data outside the immediate project/work team. Methods may include graphics, tables, presentations, reports, academic journals, and cabinet submissions by any media. See below for suggested text for this field.
|
|||
Asset Documentation | This field contains links or references to file locations that provide documentation related to administration and management of the dataset. |
Included information may include TRIM folder links, Confluence links, approval records, briefs, metadata documentation or any other relevant materials. Providing this documentation allows internal users to easily find and locate related information about the asset, reducing the risk of documentation being lost and providing additional context. |
|||
Connected Systems | Use this field to record the systems that collect, access, and use the data. Include any relevant systems that interact with or rely on this dataset. |
This field documents the systems that collect, access, and use the data. This information is important for understanding the data ecosystem and how different systems interact with the dataset. Providing information about connected systems helps users understand the context in which the data is used and the potential implications for data management and sharing. |
|||
Keyword | Word(s) or tags that describe the data asset's content. |
These word(s) or terms describe the topic(s) covered by the data asset. It answers the question “what is this data asset about?” and supports data discovery. When selecting keywords, consider what search terms your users may choose when searching for the data asset. Where multiple keywords apply, separate the terms with a comma ‘,’. |
|||
Alert Message | Use this field to highlight any important information or updates that users need to be aware of. This message should generally be displayed at the top of the item page. |
This alert serves as a key communication tool to inform users about important information, updates, or notices related to the item. It can be used to convey critical messages that users need to be aware of. Possible reasons to use the alert include:
|
|||
Admin Only | This field is only to be used by administrators; the information is not to be shared to public. This may include metadata working files, TRIM links to relevant information. This field is added by the Department of Education. | ||||
Data Custodian | (DoE FIELD TO BE POSSIBLY DECOMMISSIONED) [Core] The custodian(s) of the data asset. |
Official Definition
A representation of a dataset in a catalog. Data Catalog Vocabulary (DCAT): 5.3 Class: Dataset