short-paper

Predicting Role Relevance with Minimal Domain Expertise in a Financial Domain

Author:
Mayank Kejriwal

Information Sciences Institute, USC Viterbi School of Engineering, Marina Del Rey, CA

Information Sciences Institute, USC Viterbi School of Engineering, Marina Del Rey, CA
View Profile

DSMM'17: Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic DatasetsMay 2017Article No.: 10Pages 1–2https://doi.org/10.1145/3077240.3077249

Published:14 May 2017Publication History

DSMM'17: Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets

Pages 1–2

ABSTRACT

Word embeddings have made enormous inroads in recent years in a wide variety of text mining applications. In this paper, we explore a word embedding-based architecture for predicting the relevance of a role between two financial entities within the context of natural language sentences. In this extended abstract, we propose a pooled approach that uses a collection of sentences to train word embeddings using the skip-gram word2vec architecture. We use the word embeddings to obtain context vectors that are assigned one or more labels based on manual annotations. We train a machine learning classifier using the labeled context vectors, and use the trained classifier to predict contextual role relevance on test data. Our approach serves as a good minimal-expertise baseline for the task as it is simple and intuitive, uses open-source modules, requires little feature crafting effort and performs well across roles.

References

T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems, pages 3111--3119, 2013. Google ScholarDigital Library
L. Raschid, D. Burdick, M. Flood, J. Grant, J. Langsam, I. Soboroff, and E. Zotkina. Financial entity identification and information integration (FEIII) challenge 2017: The report of the organizing committee. In Proceedings of the Workshop on Data Science for Macro-Modeling (DSMM@SIGMOD), 2017. Google ScholarDigital Library
C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to ad hoc information retrieval. In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pages 334--342. ACM, 2001. Google ScholarDigital Library

Recommendations

The impact of corpus domain on word representation: a study on Persian word embeddings

Word embedding, has been a great success story for natural language processing in recent years. The main purpose of this approach is providing a vector representation of words based on neural network language modeling. Using a large training corpus, the ...
Read More
A study of lexical function detection with word2vec and supervised machine learning
Special Section: Applied Machine Learning and Management of Volatility, Uncertainty, Complexity & Ambiguity (V.U.C.A)

In this work, we report the results of our experiments on the task of distinguishing the semantics of verb-noun collocations in a Spanish corpus. This semantics was represented by four lexical functions of the Meaning-Text Theory. Each lexical function ...
Read More
Bilingual embeddings with random walks over multilingual wordnets
Abstract
Bilingual word embeddings represent words of two languages in the same space, and allow to transfer knowledge from one language to the other without machine translation. The main approach is to train monolingual embeddings first and ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

DSMM'17: Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets
May 2017
58 pages
ISBN:9781450350310
DOI:10.1145/3077240

Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 14 May 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Distributional Semantics
Role Relevance
Triples Ranking
Word2Vec
Qualifiers
- short-paper
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate32of64submissions,50%
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 69
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Predicting Role Relevance with Minimal Domain Expertise in a Financial Domain

DSMM'17: Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets

ABSTRACT

References

Cited By

Recommendations

The impact of corpus domain on word representation: a study on Persian word embeddings

A study of lexical function detection with word2vec and supervised machine learning

Bilingual embeddings with random walks over multilingual wordnets

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Predicting Role Relevance with Minimal Domain Expertise in a Financial Domain

DSMM'17: Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets

ABSTRACT

References

Cited By

Recommendations

The impact of corpus domain on word representation: a study on Persian word embeddings

A study of lexical function detection with word2vec and supervised machine learning

Bilingual embeddings with random walks over multilingual wordnets

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media