Withdraw
Loading…
Video representation learning for vision-driven activity analysis in construction
Torres Calderon, Wilfredo
This item is only available for download by members of the University of Illinois community. Students, faculty, and staff at the U of I may log in with your NetID and password to view the item. If you are trying to access an Illinois-restricted dissertation or thesis, you can request a copy through your library's Inter-Library Loan office or purchase a copy directly from ProQuest.
Permalink
https://hdl.handle.net/2142/115763
Description
- Title
- Video representation learning for vision-driven activity analysis in construction
- Author(s)
- Torres Calderon, Wilfredo
- Issue Date
- 2022-04-15
- Director of Research (if dissertation) or Advisor (if thesis)
- Golparvar-Fard, Mani
- Doctoral Committee Chair(s)
- Golparvar-Fard, Mani
- Committee Member(s)
- Hoiem, Derek
- El-Rayes, Khaled
- Liu, Liang
- El-Gohary, Nora
- Department of Study
- Civil & Environmental Eng
- Discipline
- Civil Engineering
- Degree Granting Institution
- University of Illinois at Urbana-Champaign
- Degree Name
- Ph.D.
- Degree Level
- Dissertation
- Keyword(s)
- Construction Management
- Computer Vision
- Productivity Analysis
- Deep Learning
- Abstract
- Labor and equipment are two of the most important and costly resources that directly impact productivity and profits in a construction project. Many controllable factors such as the site layout and sequence of work operations can influence the "direct-work" or "tool time" of workers and equipment. As such, construction managers require a comprehensive and continuous understanding of the on-site activities to identify productivity inhibitors that affect the operation of field resources, devise a plan to reduce these issues, and continuously measure improvements resulting from these changes. Activity analysis --the continuous process of measuring and improving the amount of time craft workers and equipment spend on actual construction-- can offer a detailed description of such field activities to the management. However, the measurement of direct-work rates has traditionally involved trained observers walking along the randomly selected routes on the site to classify and record each observed construction worker and equipment into different activity categories by visual observations. Despite the effectiveness of the results, the large size of manual observations needed to guarantee the statistical accuracy of workface assessment and the cost associated with the process can easily discourage project teams from conducting activity analysis. Other factors such as the distance limits to construction workers and equipment, the observers' bias and fatigue, as well as the observers' visual judgments due to overproductiveness phenomenon caused by construction workers under observations or instantaneous reaction of the observer to categorize work activities can lead to erroneous data and thus negatively impact the baselines for productivity improvements. Over the past two decades, a large number of time-lapse and video cameras have been installed on construction sites to offer remote progress monitoring and safety analysis. The ubiquitous nature of these cameras and the improved quality of their video feeds provide an opportunity to develop systems that can automatically offer activity analysis to the management and allow them to focus on planning and implementing productivity improvements to work sequences. To do so, several computer vision techniques have been developed in recent years to detect, track, and analyze motion trajectories and activities of the workers and equipment from these video feeds. While computer vision-based detection and tracking techniques have significantly matured, analyzing worker and equipment activities has remained largely an unsolved problem. Interpretation, identification, and joint-optimization of work activities in conjunction with ergonomics, require approaches that address the following questions: (a) what are the most appropriate atomic primitive representations for worker and equipment activities, (b) how to combine these primitive representations, particularly in the context of equipment-worker-tool interactions and site-layout context to produce complex activities,(c) what are the required invariances for machine learning-based inference algorithms, (d) how to build computational models and multi-objective optimizations for each of these inferences to enable joint productivity-ergonomic assessments, and lastly (e) how a significant amount of ground truth data can be generated to efficiently and effectively train these machine learning models. This dissertation provides a broad overview and in-depth discussion of these issues. Specifically, it introduces a new collection of deep learning methodologies and optimization algorithms to leverage real-world and synthetic visual data to offer activity analysis for both equipment and worker operations. In these algorithms, the spatio-temporal characterization of construction activities is described in terms of appearance, motion, pose reconstructions, and agent-tool-environment interactions. A substantial body of experimental results is presented, demonstrating how these distinguishable data representations allow for temporally coherent interpretations of the worker and equipment activities to produce reliable predictions for workface assessment and productivity improvements. In addition, these results illustrate that reasoning about atomic activities, ergonomic assessment, and accumulated worker-joint movements allows for finding optimal activity execution sequences per worker. It is shown that the actionable insight from these assessments can offer management an opportunity to review and revise work sequences and site layout such that productivity is maximized without negatively impacting worker health and safety in site operations. The merits and drawbacks of these strategies are examined in-depth concerning the practice of activity analysis to demonstrate their potential for minimizing the time needed for workface assessment and thus, allowing professionals to focus their time on the more important task of root-cause analysis and investigating alternatives for performance improvement in field operations.
- Graduation Semester
- 2022-05
- Type of Resource
- Thesis
- Copyright and License Information
- Copyright 2022 Wilfredo Torres Calderon
Owning Collections
Graduate Dissertations and Theses at Illinois PRIMARY
Graduate Theses and Dissertations at IllinoisManage Files
Loading…
Edit Collection Membership
Loading…
Edit Metadata
Loading…
Edit Properties
Loading…
Embargoes
Loading…