Mechanical and Aerospace Engineering Faculty Research & Creative Works

Action Completeness Modeling with Background Aware Networks for Weakly-Supervised Temporal Action Localization

Md Moniruzzaman
Zhaozheng Yin, Missouri University of Science and TechnologyFollow
Zhihai He
Ruwen Qin
Ming-Chuan Leu, Missouri University of Science and TechnologyFollow

Abstract

The state-of-the-art of fully-supervised methods for temporal action localization from untrimmed videos has achieved impressive results. Yet, it remains unsatisfactory for the weakly-supervised temporal action localization, where only video-level action labels are given without the timestamp annotation on when the actions occur. The main reason comes from that, the weakly-supervised networks only focus on the highly discriminative frames, but there are some ambiguous frames in both background and action classes. The ambiguous frames in background class are very similar to the real actions, which may be treated as target actions and result in false positives. On the other hand, the ambiguous frames in action class which possibly contain action instances, are prone to be false negatives by the weakly-supervised networks and result in a coarse localization. To solve these problems, we introduce a novel weakly-supervised Action Completeness Modeling with Background Aware Networks (ACM-BANets). Our Background Aware Network (BANet) contains a weight-sharing two-branch architecture, with an action guided Background aware Temporal Attention Module (B-TAM) and an asymmetrical training strategy, to suppress both highly discriminative and ambiguous background frames to remove the false positives. Our action completeness modeling contains multiple BANets, and the BANets are forced to discover different but complementary action instances to completely localize the action instances in both highly discriminative and ambiguous action frames. In the i-th iteration, the i-th BANet discovers the discriminative features, which are then erased from the feature map. The partially-erased feature map is fed into the (i+1)-th BANet of the next iteration to force this BANet to discover discriminative features different from the i-th BANet. Evaluated on two challenging untrimmed video datasets, THUMOS14 and ActivityNet1.3, our approach outperforms all the current weakly-supervised methods for temporal action localization.

Recommended Citation

M. Moniruzzaman et al., "Action Completeness Modeling with Background Aware Networks for Weakly-Supervised Temporal Action Localization," Proceedings of the 28th ACM International Conference on Multimedia (2020, Seattle, WA), pp. 2166 - 2174, Association for Computing Machinery (ACM), Oct 2020.

The definitive version is available at https://doi.org/10.1145/3394171.3413687

Meeting Name

28th ACM International Conference on Multimedia, MM 2020 (2020: Oct. 12-16, Seattle, WA)

Department(s)

Mechanical and Aerospace Engineering

Comments

National Science Foundation, Grant 1954548

Keywords and Phrases

action completeness modeling; background aware networks; temporal action localization; weakly-supervised learning

International Standard Book Number (ISBN)

978-145037988-5

Document Type

Article - Conference proceedings

Document Version

Citation

File Type

text

Language(s)

English

Rights

Publication Date

16 Oct 2020

Link to Full Text

COinS

Mechanical and Aerospace Engineering Faculty Research & Creative Works

Action Completeness Modeling with Background Aware Networks for Weakly-Supervised Temporal Action Localization

Abstract

Recommended Citation

Meeting Name

Department(s)

Comments

Keywords and Phrases

International Standard Book Number (ISBN)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Search

Browse

Faculty Gallery

Author Corner

Related Content

Useful Links

Article Locations

Mechanical and Aerospace Engineering Faculty Research & Creative Works

Action Completeness Modeling with Background Aware Networks for Weakly-Supervised Temporal Action Localization

Author

Abstract

Recommended Citation

Meeting Name

Department(s)

Comments

Keywords and Phrases

International Standard Book Number (ISBN)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Share

Search

Browse

Faculty Gallery

Author Corner

Related Content

Useful Links

Article Locations