Elmvis+: Fast Nonlinear Visualization Technique based on Cosine Distance and Extreme Learning Machines

Abstract

This Paper Presents a Fast Algorithm and an Accelerated Toolbox. 11https://github.com/akusok/elmvis for Data Visualization. the Visualization is Stated as an Assignment Problem between Data Samples and the Same Number of Given Visualization Points. the Mapping Function is Approximated by an Extreme Learning Machine, Which Provides an Error for a Current Assignment. This Work Presents a New Mathematical Formulation of the Error Function based on Cosine Similarity. It Provides a Closed Form Equation for a Change of Error for Exchanging Assignments between Two Random Samples (Called a Swap), and an Extreme Speed-Up over the Original Method Even for a Very Large Corpus Like the Mnist Handwritten Digits Dataset. the Method Starts from Random Assignment, and Continues in a Greedy Optimization Algorithm by Randomly Swapping Pairs of Samples, Keeping the Swaps that Reduce the Error. the Toolbox Speed Reaches a Million of Swaps Per Second, and Thousands of Model Updates Per Second for Successful Swaps in Gpu Implementation, Even for Very Large Dataset Like Mnist Handwritten Digits.

Department(s)

Engineering Management and Systems Engineering

Keywords and Phrases

Big Data; Cosine Distance; Extreme Learning Machines; Nonlinear Dimensionality Reduction; Projection; Visualization

International Standard Serial Number (ISSN)

1872-8286; 0925-2312

Document Type

Article - Journal

Document Version

Citation

File Type

text

Language(s)

English

Rights

© 2024 Elsevier, All rights reserved.

Publication Date

12 Sep 2016

Share

 
COinS