DIGA620 Data Engineering (3 cr.)Prerequisite(s): DIGA605 or equivalent or consent of program director The course utilizes data processing requirements necessary to implement technology-based analytics. The course explores strengths and limitations of various data formats to make better decisions. The importance of structured and unstructured data formats as well as performing methods of data extraction, transformation, and loading are covered. Data wrangling methodologies explore constructing custom data pipelines to support efficient analysis. These methods include cleaning, filtering, standardizing, and categorizing data. Processes to review data for accuracy, consistency, and completeness are covered as well as techniques to mitigate error and improve data integrity.
Upon completion of the course students are expected to be able to do the following:
- Perform extract, transform, and load (ETL) processes using structure and unstructured data formats.
- Assess data for error and implement techniques to improve data integrity.
- Determine appropriate data formats for given situations.
- Design and document processes for converting raw data into a product suitable for analysis.
Add to Portfolio (opens a new window)
|