subscribe to arXiv mailings

doi 10.1109/ICSC52841.2022.00012

Impact of Stop Sets on Stopping Active Learning for Text Classification

Authors: Luke Kurlandski, Michael Bloodgood

Abstract: Active learning is an increasingly important branch of machine learning and a powerful technique for natural language processing. The main advantage of active learning is its potential to reduce the amount of labeled data needed to learn high-performing models. A vital aspect of an effective active learning algorithm is the determination of when to stop obtaining additional labeled data. Several l… ▽ More Active learning is an increasingly important branch of machine learning and a powerful technique for natural language processing. The main advantage of active learning is its potential to reduce the amount of labeled data needed to learn high-performing models. A vital aspect of an effective active learning algorithm is the determination of when to stop obtaining additional labeled data. Several leading state-of-the-art stopping methods use a stop set to help make this decision. However, there has been relatively less attention given to the choice of stop set than to the stopping algorithms that are applied on the stop set. Different choices of stop sets can lead to significant differences in stopping method performance. We investigate the impact of different stop set choices on different stopping methods. This paper shows the choice of the stop set can have a significant impact on the performance of stopping methods and the impact is different for stability-based methods from that on confidence-based methods. Furthermore, the unbiased representative stop sets suggested by original authors of methods work better than the systematically biased stop sets used in recently published work, and stopping methods based on stabilizing predictions have stronger performance than confidence-based stopping methods when unbiased representative stop sets are used. We provide the largest quantity of experimental results on the impact of stop sets to date. The findings are important for helping to illuminate the impact of this important aspect of stopping methods that has been under-considered in recently published work and that can have a large practical impact on the performance of stopping methods for important semantic computing applications such as technology assisted review and text classification more broadly. △ Less

Submitted 2 April, 2022; v1 submitted 8 January, 2022; originally announced January 2022.

Comments: 8 pages, 3 tables, 1 figure; published in Proceedings of the IEEE 16th International Conference on Semantic Computing (ICSC), pages 25-32, January 2022. IEEE

ACM Class: H.3.3; I.2.6; I.2.7; I.5.4

Journal ref: In Proceedings of the 2022 IEEE 16th International Conference on Semantic Computing (ICSC), pages 25-32, January 2022. IEEE

arXiv:2001.10337 [pdf, other]

doi 10.1109/ICSC.2020.00018

Early Forecasting of Text Classification Accuracy and F-Measure with Active Learning

Authors: Thomas Orth, Michael Bloodgood

Abstract: When creating text classification systems, one of the major bottlenecks is the annotation of training data. Active learning has been proposed to address this bottleneck using stopping methods to minimize the cost of data annotation. An important capability for improving the utility of stopping methods is to effectively forecast the performance of the text classification models. Forecasting can be… ▽ More When creating text classification systems, one of the major bottlenecks is the annotation of training data. Active learning has been proposed to address this bottleneck using stopping methods to minimize the cost of data annotation. An important capability for improving the utility of stopping methods is to effectively forecast the performance of the text classification models. Forecasting can be done through the use of logarithmic models regressed on some portion of the data as learning is progressing. A critical unexplored question is what portion of the data is needed for accurate forecasting. There is a tension, where it is desirable to use less data so that the forecast can be made earlier, which is more useful, versus it being desirable to use more data, so that the forecast can be more accurate. We find that when using active learning it is even more important to generate forecasts earlier so as to make them more useful and not waste annotation effort. We investigate the difference in forecasting difficulty when using accuracy and F-measure as the text classification system performance metrics and we find that F-measure is more difficult to forecast. We conduct experiments on seven text classification datasets in different semantic domains with different characteristics and with three different base machine learning algorithms. We find that forecasting is easiest for decision tree learning, moderate for Support Vector Machines, and most difficult for neural networks. △ Less

Submitted 11 April, 2020; v1 submitted 20 January, 2020; originally announced January 2020.

Comments: 8 pages, 9 figures, 2 tables; published in Proceedings of the IEEE 14th International Conference on Semantic Computing (ICSC), San Diego, CA, USA, pages 77-84, February 2020

ACM Class: H.3.3; I.2.6; I.2.7; I.5.4

Journal ref: In Proceedings of the 2020 IEEE 14th International Conference on Semantic Computing (ICSC), pages 77-84, San Diego, CA, USA, February 2020. IEEE

arXiv:1903.06050 [pdf]

doi 10.1021/acsnano.9b02870

Electric Switching of the Charge-Density-Wave and Normal Metallic Phases in Tantalum Disulfide Thin-Film Devices

Authors: A. Geremew, S. Rumyantsev, F. Kargar, B. Debnath, A. Nosek, M. Bloodgood, M. Bockrath, T. Salguero, R. K. Lake, A. A. Balandin

Abstract: We report on switching among three charge-density-wave phases - commensurate, nearly commensurate, incommensurate - and the high-temperature normal metallic phase in thin-film 1T-TaS2 devices induced by application of an in-plane electric field. The electric switching among all phases has been achieved over a wide temperature range, from 77 K to 400 K. The low-frequency electronic noise spectrosco… ▽ More We report on switching among three charge-density-wave phases - commensurate, nearly commensurate, incommensurate - and the high-temperature normal metallic phase in thin-film 1T-TaS2 devices induced by application of an in-plane electric field. The electric switching among all phases has been achieved over a wide temperature range, from 77 K to 400 K. The low-frequency electronic noise spectroscopy has been used as an effective tool for monitoring the transitions, particularly the switching from the incommensurate charge-density-wave phase to the normal metal phase. The noise spectral density exhibits sharp increases at the phase transition points, which correspond to the step-like changes in resistivity. Assignment of the phases is consistent with low-field resistivity measurements over the temperature range from 77 K to 600 K. Analysis of the experimental data and calculations of heat dissipation suggest that Joule heating plays a dominant role in the electric-field induced transitions in the tested 1T-TaS2 devices on Si/SiO2 substrates. The possibility of electrical switching among four different phases of 1T-TaS2 is a promising step toward nanoscale device applications. The results also demonstrate the potential of noise spectroscopy for investigating and identifying phase transitions in materials. △ Less

Submitted 14 March, 2019; originally announced March 2019.

Comments: 32 pages, 7 figures

Journal ref: ACS Nano, 13, 7231 (2019)

arXiv:1901.09126 [pdf, ps, other]

doi 10.1109/ICOSC.2019.8665546

The Use of Unlabeled Data versus Labeled Data for Stopping Active Learning for Text Classification

Authors: Garrett Beatty, Ethan Kochis, Michael Bloodgood

Abstract: Annotation of training data is the major bottleneck in the creation of text classification systems. Active learning is a commonly used technique to reduce the amount of training data one needs to label. A crucial aspect of active learning is determining when to stop labeling data. Three potential sources for informing when to stop active learning are an additional labeled set of data, an unlabeled… ▽ More Annotation of training data is the major bottleneck in the creation of text classification systems. Active learning is a commonly used technique to reduce the amount of training data one needs to label. A crucial aspect of active learning is determining when to stop labeling data. Three potential sources for informing when to stop active learning are an additional labeled set of data, an unlabeled set of data, and the training data that is labeled during the process of active learning. To date, no one has compared and contrasted the advantages and disadvantages of stopping methods based on these three information sources. We find that stopping methods that use unlabeled data are more effective than methods that use labeled data. △ Less

Submitted 22 April, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

Comments: 8 pages, 4 figures, 3 tables; published in Proceedings of the IEEE 13th International Conference on Semantic Computing (ICSC), Newport Beach, CA, USA, pages 287-294, January 2019

ACM Class: H.3.3; I.2.6; I.2.7; I.5.4

Journal ref: In Proceedings of the 2019 IEEE 13th International Conference on Semantic Computing (ICSC), pages 287-294, Newport Beach, CA, USA, January 2019. IEEE

arXiv:1901.09118 [pdf, ps, other]

doi 10.1109/ICOSC.2019.8665646

Stopping Active Learning based on Predicted Change of F Measure for Text Classification

Authors: Michael Altschuler, Michael Bloodgood

Abstract: During active learning, an effective stopping method allows users to limit the number of annotations, which is cost effective. In this paper, a new stopping method called Predicted Change of F Measure will be introduced that attempts to provide the users an estimate of how much performance of the model is changing at each iteration. This stopping method can be applied with any base learner. This m… ▽ More During active learning, an effective stopping method allows users to limit the number of annotations, which is cost effective. In this paper, a new stopping method called Predicted Change of F Measure will be introduced that attempts to provide the users an estimate of how much performance of the model is changing at each iteration. This stopping method can be applied with any base learner. This method is useful for reducing the data annotation bottleneck encountered when building text classification systems. △ Less

Submitted 22 April, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

Comments: 8 pages, 12 tables; published in Proceedings of the 2019 IEEE 13th International Conference on Semantic Computing (ICSC), Newport Beach, CA, USA, pages 47-54, January 2019

ACM Class: H.3.3; I.2.6; I.2.7; I.5.4

Journal ref: In Proceedings of the 2019 IEEE 13th International Conference on Semantic Computing (ICSC), pages 47-54, Newport Beach, CA, USA, January 2019. IEEE

arXiv:1901.01475 [pdf]

doi 10.7567/1882-0786/ab0397

Low-Frequency Noise Spectroscopy of Charge-Density-Wave Phase Transitions in Vertical Quasi-2D Devices

Authors: Ruben Salgado, Amirmahdi Mohammadzadeh, Fariborz Kargar, Adane Geremew, Chun-Yu Huang, Matthew A. Bloodgood, Sergey Rumyantsev, Tina T. Salguero, Alexander A. Balandin

Abstract: We report results regarding the electron transport in vertical quasi-2D layered 1T-TaS2 charge-density-wave devices. The low-frequency noise spectroscopy was used as a tool to study changes in the cross-plane electrical characteristics of the quasi-2D material below room temperature. The noise spectral density revealed strong peaks - changing by more than an order-of-magnitude - at the temperature… ▽ More We report results regarding the electron transport in vertical quasi-2D layered 1T-TaS2 charge-density-wave devices. The low-frequency noise spectroscopy was used as a tool to study changes in the cross-plane electrical characteristics of the quasi-2D material below room temperature. The noise spectral density revealed strong peaks - changing by more than an order-of-magnitude - at the temperatures closely matching the electrical resistance steps. Some of the noise peaks appeared below the temperature of the commensurate to nearly-commensurate charge-density-wave transition, possibly indicating the presence of the debated "hidden" phase transitions. These results confirm the potential of the noise spectroscopy for investigations of electron transport and phase transitions in novel materials. △ Less

Submitted 5 January, 2019; originally announced January 2019.

Comments: 16 pages; 5 figures

Journal ref: Applied Physics Express, 12, 037001 (2019)

arXiv:1901.00551 [pdf]

doi 10.1039/C9NR01614G

Proton-Irradiation-Immune Electronics Implemented with Two-Dimensional Charge-Density-Wave Devices

Authors: A. Geremew, F. Kargar, E. X. Zhang, S. E. Zhao, E. Aytan, M. A. Bloodgood, T. T. Salguero, S. Rumyantsev, A. Fedoseyev, D. M. Fleetwood, A. A. Balandin

Abstract: Proton radiation damage is an important failure mechanism for electronic devices in near-Earth orbits, deep space and high energy physics facilities. Protons can cause ionizing damage and atomic displacements, resulting in device degradation and malfunction. Shielding of electronics increases the weight and cost of the systems but does not eliminate destructive single events produced by energetic… ▽ More Proton radiation damage is an important failure mechanism for electronic devices in near-Earth orbits, deep space and high energy physics facilities. Protons can cause ionizing damage and atomic displacements, resulting in device degradation and malfunction. Shielding of electronics increases the weight and cost of the systems but does not eliminate destructive single events produced by energetic protons. Modern electronics based on semiconductors - even those specially designed for radiation hardness - remain highly susceptible to proton damage. Here we demonstrate that room temperature (RT) charge-density-wave (CDW) devices with quasi-two-dimensional (2D) 1T-TaS2 channels show remarkable immunity to bombardment with 1.8 MeV protons to a fluence of at least 10^14 H+cm^2. Current-voltage I-V characteristics of these 2D CDW devices do not change as a result of proton irradiation, in striking contrast to most conventional semiconductor devices or other 2D devices. Only negligible changes are found in the low-frequency noise spectra. The radiation immunity of these "all-metallic" CDW devices can be attributed to their two-terminal design, quasi-2D nature of the active channel, and high concentration of charge carriers in the utilized CDW phases. Such devices, capable of operating over a wide temperature range, can constitute a crucial segment of future electronics for space, particle accelerator and other radiation environments. △ Less

Submitted 2 January, 2019; originally announced January 2019.

Comments: 18 pages, 2 display items

Journal ref: Nanoscale, 11, 8380 - 8386 (2019)

arXiv:1808.09618 [pdf]

Anomalous Characteristics of the Generation - Recombination Noise in Quasi-One-Dimensional Van der Waals Nanoribbons

Authors: Adane K. Geremew, Sergey Rumyantsev, Matthew A. Bloodgood, Tina T. Salguero, Alexander A. Balandin

Abstract: We describe the low-frequency current fluctuations, i.e. electronic noise, in quasi-one-dimensional ZrTe3 van der Waals nanoribbons, which have recently attracted attention owing to their extraordinary high current carrying capacity. Whereas the low-frequency noise spectral density reveals 1/f behavior near room temperature, it is dominated by the Lorentzian bulges of the generation - recombinatio… ▽ More We describe the low-frequency current fluctuations, i.e. electronic noise, in quasi-one-dimensional ZrTe3 van der Waals nanoribbons, which have recently attracted attention owing to their extraordinary high current carrying capacity. Whereas the low-frequency noise spectral density reveals 1/f behavior near room temperature, it is dominated by the Lorentzian bulges of the generation - recombination noise at low temperatures (f is the frequency). Unexpectedly, the corner frequency of the observed Lorentzian peaks shows strong sensitivity to the applied source - drain bias. This dependence on electric field can be explained by the Frenkel-Poole effect in the scenario where the voltage drop happens predominantly on the defects, which block the quasi-1D conduction channels. We also have found that the activation energy of the characteristic frequencies of the G-R noise in quasi-1D ZrTe3 is defined primarily by the temperature dependence of the capture cross-section of the defects rather than by their energy position. These results are important for the application of quasi-1D van der Waals materials in ultimately downscaled electronics. △ Less

Submitted 28 August, 2018; originally announced August 2018.

Comments: 22 pages; 7 figures

Journal ref: Nanoscale, 10, 42, 19749 (2018)

arXiv:1802.02536 [pdf]

doi 10.1021/acs.nanolett.8b00729

Low-Frequency Noise and Sliding of the Charge Density Waves in Two-Dimensional Materials

Authors: Guanxiong Liu, Sergey Rumyantsev, Matthew. A. Bloodgood, Tina T. Salguero, Alexander A. Balandin

Abstract: There has been a recent renewal of interest in charge-density-wave (CDW) phenomena, primarily driven by the emergence of two-dimensional (2D) layered CDW materials, such as 1T-TaS2, characterized by very high transition temperatures to CDW phases. In the extensively studied classical bulk CDW materials with quasi-1D crystal structure, the charge carrier transport exhibits intriguing sliding behavi… ▽ More There has been a recent renewal of interest in charge-density-wave (CDW) phenomena, primarily driven by the emergence of two-dimensional (2D) layered CDW materials, such as 1T-TaS2, characterized by very high transition temperatures to CDW phases. In the extensively studied classical bulk CDW materials with quasi-1D crystal structure, the charge carrier transport exhibits intriguing sliding behavior, which reveals itself in the frequency domain as "narrowband" and "broadband" noise. Despite the increasing attention on physics of 2D CDWs, there have been few reports of CDW sliding, specifically in quasi-2D rare-earth tritellurides and none on the noise in any of 2D CDW systems. Here we report the results of low-frequency noise (LFN) measurements on 1T-TaS2 thin films - archetypal 2D CDW systems, as they are driven from the nearly commensurate (NC) to incommensurate (IC) CDW phases by voltage and temperature stimuli. We have found that noise in 1T-TaS2 devices has two pronounced maxima at the bias voltages, which correspond to the onset of CDW sliding and the NC-to-IC phase transition. We observed unusual Lorentzian noise features and exceptionally strong noise dependence on electric bias and temperature. We argue that LFN in 2D CDW systems has unique physical origin, different from known fundamental noise types. The specifics of LFN in 2D CDW materials can be explained by invoking the concept of interacting discrete fluctuators in the NC-CDW phase. Noise spectroscopy can serve as a useful tool for understanding electronic transport phenomena in 2D CDW materials characterized by coexistence of different phases and strong CDW pinning. △ Less

Submitted 7 February, 2018; originally announced February 2018.

Comments: 18 pages; 3 figures

Journal ref: Nano Letters, 18, 3630 (2018)

arXiv:1801.07887 [pdf, other]

doi 10.1109/ICSC.2018.00059

Impact of Batch Size on Stopping Active Learning for Text Classification

Authors: Garrett Beatty, Ethan Kochis, Michael Bloodgood

Abstract: When using active learning, smaller batch sizes are typically more efficient from a learning efficiency perspective. However, in practice due to speed and human annotator considerations, the use of larger batch sizes is necessary. While past work has shown that larger batch sizes decrease learning efficiency from a learning curve perspective, it remains an open question how batch size impacts meth… ▽ More When using active learning, smaller batch sizes are typically more efficient from a learning efficiency perspective. However, in practice due to speed and human annotator considerations, the use of larger batch sizes is necessary. While past work has shown that larger batch sizes decrease learning efficiency from a learning curve perspective, it remains an open question how batch size impacts methods for stopping active learning. We find that large batch sizes degrade the performance of a leading stopping method over and above the degradation that results from reduced learning efficiency. We analyze this degradation and find that it can be mitigated by changing the window size parameter of how many past iterations of learning are taken into account when making the stopping decision. We find that when using larger batch sizes, stopping methods are more effective when smaller window sizes are used. △ Less

Submitted 16 May, 2018; v1 submitted 24 January, 2018; originally announced January 2018.

Comments: 2 pages, 1 table; published in Proceedings of the IEEE 12th International Conference on Semantic Computing (ICSC 2018), Laguna Hills, CA, USA, pages 306-307, January 2018

ACM Class: H.3.3; I.2.6; I.2.7; I.5.4

Journal ref: In Proceedings of the 2018 IEEE 12th International Conference on Semantic Computing (ICSC), pages 306-307, Laguna Hills, CA, USA, January 2018. IEEE

arXiv:1801.07875 [pdf, other]

doi 10.1109/ICSC.2018.00029

Support Vector Machine Active Learning Algorithms with Query-by-Committee versus Closest-to-Hyperplane Selection

Authors: Michael Bloodgood

Abstract: This paper investigates and evaluates support vector machine active learning algorithms for use with imbalanced datasets, which commonly arise in many applications such as information extraction applications. Algorithms based on closest-to-hyperplane selection and query-by-committee selection are combined with methods for addressing imbalance such as positive amplification based on prevalence stat… ▽ More This paper investigates and evaluates support vector machine active learning algorithms for use with imbalanced datasets, which commonly arise in many applications such as information extraction applications. Algorithms based on closest-to-hyperplane selection and query-by-committee selection are combined with methods for addressing imbalance such as positive amplification based on prevalence statistics from initial random samples. Three algorithms (ClosestPA, QBagPA, and QBoostPA) are presented and carefully evaluated on datasets for text classification and relation extraction. The ClosestPA algorithm is shown to consistently outperform the other two in a variety of ways and insights are provided as to why this is the case. △ Less

Submitted 16 May, 2018; v1 submitted 24 January, 2018; originally announced January 2018.

Comments: 8 pages, 7 figures, 3 tables; published in Proceedings of the IEEE 12th International Conference on Semantic Computing (ICSC 2018), Laguna Hills, CA, USA, pages 148-155, January 2018

ACM Class: H.3.3; I.2.6; I.2.7; I.5.4

Journal ref: In Proceedings of the 2018 IEEE 12th International Conference on Semantic Computing (ICSC), pages 148-155, Laguna Hills, CA, USA, January 2018. IEEE

arXiv:1712.01354 [pdf]

doi 10.1109/LED.2017.2763597

Total Ionizing Dose Effects on Threshold Switching in 1T-Tantalum Disulfide Charge-Density-Wave Devices

Authors: G. Liu, E. X. Zhang, C. D. Liang, M. A. Bloodgood, T. T. Salguero, D. M. Fleetwood, A. A. Balandin

Abstract: The 1T polytype of TaS2 exhibits voltage-triggered threshold switching as a result of a phase transition from nearly commensurate to incommensurate charge density wave states. Threshold switching, persistent above room temperature, can be utilized in a variety of electronic devices, e.g., voltage controlled oscillators. We evaluated the total-ionizing-dose response of thin film 1T-TaS2 at doses up… ▽ More The 1T polytype of TaS2 exhibits voltage-triggered threshold switching as a result of a phase transition from nearly commensurate to incommensurate charge density wave states. Threshold switching, persistent above room temperature, can be utilized in a variety of electronic devices, e.g., voltage controlled oscillators. We evaluated the total-ionizing-dose response of thin film 1T-TaS2 at doses up to 1 Mrad(SiO2). The threshold voltage changed by less than 2% after irradiation, with persistent self-sustained oscillations observed through the full irradiation sequence. The radiation hardness is attributed to the high intrinsic carrier concentration of 1T-TaS2 in both of the phases that lead to threshold switching. These results suggest that charge density wave devices, implemented with thin films of 1T-TaS2, are promising for applications in high radiation environments. △ Less

Submitted 18 October, 2017; originally announced December 2017.

Comments: 4 pages; 4 figures

Journal ref: EEE Electron Device Letters, 38, 1724 (2017)

arXiv:1706.01570 [pdf, other]

Acquisition of Translation Lexicons for Historically Unwritten Languages via Bridging Loanwords

Authors: Michael Bloodgood, Benjamin Strauss

Abstract: With the advent of informal electronic communications such as social media, colloquial languages that were historically unwritten are being written for the first time in heavily code-switched environments. We present a method for inducing portions of translation lexicons through the use of expert knowledge in these settings where there are approximately zero resources available other than a langua… ▽ More With the advent of informal electronic communications such as social media, colloquial languages that were historically unwritten are being written for the first time in heavily code-switched environments. We present a method for inducing portions of translation lexicons through the use of expert knowledge in these settings where there are approximately zero resources available other than a language informant, potentially not even large amounts of monolingual data. We investigate inducing a Moroccan Darija-English translation lexicon via French loanwords bridging into English and find that a useful lexicon is induced for human-assisted translation and statistical machine translation. △ Less

Submitted 20 August, 2017; v1 submitted 5 June, 2017; originally announced June 2017.

Comments: 5 pages, 1 figure, 1 table; published in the Proceedings of the 10th Workshop on Building and Using Comparable Corpora, pages 21-25, Vancouver, Canada, August 2017

ACM Class: I.2.7

Journal ref: In Proceedings of the 10th Workshop on Building and Using Comparable Corpora, pages 21-25, Vancouver, Canada, August 2017. Association for Computational Linguistics

arXiv:1704.07050 [pdf, other]

doi 10.18653/v1/P17-1181

Using Global Constraints and Reranking to Improve Cognates Detection

Authors: Michael Bloodgood, Benjamin Strauss

Abstract: Global constraints and reranking have not been used in cognates detection research to date. We propose methods for using global constraints by performing rescoring of the score matrices produced by state of the art cognates detection systems. Using global constraints to perform rescoring is complementary to state of the art methods for performing cognates detection and results in significant perfo… ▽ More Global constraints and reranking have not been used in cognates detection research to date. We propose methods for using global constraints by performing rescoring of the score matrices produced by state of the art cognates detection systems. Using global constraints to perform rescoring is complementary to state of the art methods for performing cognates detection and results in significant performance improvements beyond current state of the art performance on publicly available datasets with different language pairs and various conditions such as different levels of baseline state of the art performance and different data size conditions, including with more realistic large data size conditions than have been evaluated with in the past. △ Less

Submitted 19 August, 2017; v1 submitted 24 April, 2017; originally announced April 2017.

Comments: 10 pages, 6 figures, 6 tables; published in the Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, pages 1983-1992, Vancouver, Canada, July 2017

ACM Class: I.2.6; I.2.7; I.5.1; I.5.4

Journal ref: In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, pages 1983-1992, Vancouver, Canada, July 2017. Association for Computational Linguistics

arXiv:1702.06216 [pdf, other]

doi 10.1109/ICSC.2017.75

Filtering Tweets for Social Unrest

Authors: Alan Mishler, Kevin Wonus, Wendy Chambers, Michael Bloodgood

Abstract: Since the events of the Arab Spring, there has been increased interest in using social media to anticipate social unrest. While efforts have been made toward automated unrest prediction, we focus on filtering the vast volume of tweets to identify tweets relevant to unrest, which can be provided to downstream users for further analysis. We train a supervised classifier that is able to label Arabic… ▽ More Since the events of the Arab Spring, there has been increased interest in using social media to anticipate social unrest. While efforts have been made toward automated unrest prediction, we focus on filtering the vast volume of tweets to identify tweets relevant to unrest, which can be provided to downstream users for further analysis. We train a supervised classifier that is able to label Arabic language tweets as relevant to unrest with high reliability. We examine the relationship between training data size and performance and investigate ways to optimize the model building process while minimizing cost. We also explore how confidence thresholds can be set to achieve desired levels of performance. △ Less

Submitted 1 April, 2017; v1 submitted 20 February, 2017; originally announced February 2017.

Comments: 7 pages, 8 figures, 3 tables; published in Proceedings of the 2017 IEEE 11th International Conference on Semantic Computing (ICSC), San Diego, CA, USA, pages 17-23, January 2017

ACM Class: H.3.3; I.2.6; I.2.7; I.5.4

Journal ref: In Proceedings of the 2017 IEEE 11th International Conference on Semantic Computing (ICSC), pages 17-23, San Diego, CA, USA, January 2017. IEEE

arXiv:1610.04891 [pdf]

doi 10.1021/acs.nanolett.6b04334

Low-Frequency Electronic Noise in Exfoliated Quasi-1D TaSe3 van Der Waals Nanowires

Authors: Guanxiong Liu, Sergey Rumyantsev, Matthew A. Bloodgood, Tina T. Salguero, Michael Shur, Alexander A. Balandin

Abstract: We report results of investigation of the low-frequency electronic excess noise in quasi-1D nanowires of TaSe3 capped with quasi-2D h-BN layers. Semi-metallic TaSe3 is a quasi-1D van der Waals material with exceptionally high breakdown current density. It was found that TaSe3 nanowires have lower levels of the normalized noise spectral density, compared to carbon nanotubes and graphene. The temper… ▽ More We report results of investigation of the low-frequency electronic excess noise in quasi-1D nanowires of TaSe3 capped with quasi-2D h-BN layers. Semi-metallic TaSe3 is a quasi-1D van der Waals material with exceptionally high breakdown current density. It was found that TaSe3 nanowires have lower levels of the normalized noise spectral density, compared to carbon nanotubes and graphene. The temperature-dependent measurements revealed that the low-frequency electronic 1/f noise becomes the 1/f^2-type as temperature increases to about 400 K, suggesting the onset of electromigration (f is the frequency). Using the Dutta- Horn random fluctuation model of the electronic noise in metals we determined that the noise activation energy for quasi-1D TaSe3 nanowires is approximately E_P=1.0 eV. In the framework of the empirical noise model for metallic interconnects, the extracted activation energy, related to electromigration, is E_A=0.88 eV, consistent with that for Cu and Al interconnects. Our results shed light on the physical mechanism of low-frequency 1/f noise in quasi-1D van der Waals semi-metals and suggest that such material systems have potential for ultimately downscaled local interconnect applications. △ Less

Submitted 16 October, 2016; originally announced October 2016.

Comments: 22 pages; 6 figures

Journal ref: Nano Letters, 17, 377 (2017)

arXiv:1604.03093 [pdf]

Breakdown Current Density in BN-Capped Quasi-1D TaSe3 Metallic Nanowires: Prospects of Interconnect Applications

Authors: Maxim A. Stolyarov, Guanxiong Liu, Matthew A. Bloodgood, Ece Aytan, Chenglong Jiang, Rameez Samnakay, Tina T. Salguero, Denis L. Nika, Krassimir N. Bozhilov, Alexander A. Balandin

Abstract: We report results of investigation of the current-carrying capacity of nanowires made from the quasi-1D van der Waals metal tantalum triselenide capped with quasi-2D boron nitride. The chemical vapor transport method followed by chemical and mechanical exfoliation were used to fabricate mm-long TaSe3 wires with lateral dimensions in the 20 to 70 nm range. Electrical measurements establish that TaS… ▽ More We report results of investigation of the current-carrying capacity of nanowires made from the quasi-1D van der Waals metal tantalum triselenide capped with quasi-2D boron nitride. The chemical vapor transport method followed by chemical and mechanical exfoliation were used to fabricate mm-long TaSe3 wires with lateral dimensions in the 20 to 70 nm range. Electrical measurements establish that TaSe3/h-BN nanowire heterostructures have a breakdown current density exceeding 10 MA/cm2 - an order-of-magnitude higher than that in copper. Some devices exhibited an intriguing step-like breakdown, which can be explained by the atomic thread bundle structure of the nanowires. The quasi-1D single crystal nature of TaSe3 results in low surface roughness and the absence of grain boundaries; these features potentially can enable the downscaling of these wires to lateral dimensions in the few-nm range. These results suggest that quasi-1D van der Waals metals have potential for applications in the ultimately downscaled local interconnects. △ Less

Submitted 11 April, 2016; originally announced April 2016.

Comments: 22 pages, 6 figures

Journal ref: Nanoscale, 8, 15774 (2016)

arXiv:1602.07807 [pdf, other]

doi 10.1109/ICSC.2016.38

Data Cleaning for XML Electronic Dictionaries via Statistical Anomaly Detection

Authors: Michael Bloodgood, Benjamin Strauss

Abstract: Many important forms of data are stored digitally in XML format. Errors can occur in the textual content of the data in the fields of the XML. Fixing these errors manually is time-consuming and expensive, especially for large amounts of data. There is increasing interest in the research, development, and use of automated techniques for assisting with data cleaning. Electronic dictionaries are an i… ▽ More Many important forms of data are stored digitally in XML format. Errors can occur in the textual content of the data in the fields of the XML. Fixing these errors manually is time-consuming and expensive, especially for large amounts of data. There is increasing interest in the research, development, and use of automated techniques for assisting with data cleaning. Electronic dictionaries are an important form of data frequently stored in XML format that frequently have errors introduced through a mixture of manual typographical entry errors and optical character recognition errors. In this paper we describe methods for flagging statistical anomalies as likely errors in electronic dictionaries stored in XML format. We describe six systems based on different sources of information. The systems detect errors using various signals in the data including uncommon characters, text length, character-based language models, word-based language models, tied-field length ratios, and tied-field transliteration models. Four of the systems detect errors based on expectations automatically inferred from content within elements of a single field type. We call these single-field systems. Two of the systems detect errors based on correspondence expectations automatically inferred from content within elements of multiple related field types. We call these tied-field systems. For each system, we provide an intuitive analysis of the type of error that it is successful at detecting. Finally, we describe two larger-scale evaluations using crowdsourcing with Amazon's Mechanical Turk platform and using the annotations of a domain expert. The evaluations consistently show that the systems are useful for improving the efficiency with which errors in XML electronic dictionaries can be detected. △ Less

Submitted 11 April, 2016; v1 submitted 25 February, 2016; originally announced February 2016.

Comments: 8 pages, 4 figures, 5 tables; published in Proceedings of the 2016 IEEE Tenth International Conference on Semantic Computing (ICSC), Laguna Hills, CA, USA, pages 79-86, February 2016

ACM Class: I.5.1; I.5.4; G.3; I.2.7; I.2.6

Journal ref: In Proceedings of the 2016 IEEE Tenth International Conference on Semantic Computing (ICSC), pages 79-86, Laguna Hills, CA, USA, February 2016. IEEE

arXiv:1505.05841 [pdf, other]

Translation Memory Retrieval Methods

Authors: Michael Bloodgood, Benjamin Strauss

Abstract: Translation Memory (TM) systems are one of the most widely used translation technologies. An important part of TM systems is the matching algorithm that determines what translations get retrieved from the bank of available translations to assist the human translator. Although detailed accounts of the matching algorithms used in commercial systems can't be found in the literature, it is widely beli… ▽ More Translation Memory (TM) systems are one of the most widely used translation technologies. An important part of TM systems is the matching algorithm that determines what translations get retrieved from the bank of available translations to assist the human translator. Although detailed accounts of the matching algorithms used in commercial systems can't be found in the literature, it is widely believed that edit distance algorithms are used. This paper investigates and evaluates the use of several matching algorithms, including the edit distance algorithm that is believed to be at the heart of most modern commercial TM systems. This paper presents results showing how well various matching algorithms correlate with human judgments of helpfulness (collected via crowdsourcing with Amazon's Mechanical Turk). A new algorithm based on weighted n-gram precision that can be adjusted for translator length preferences consistently returns translations judged to be most helpful by translators for multiple domains and language pairs. △ Less

Submitted 21 May, 2015; originally announced May 2015.

Comments: 9 pages, 6 tables, 3 figures; appeared in Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, April 2014

ACM Class: I.2.7

Journal ref: In Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, pages 202-210, Gothenburg, Sweden, April 2014. Association for Computational Linguistics

arXiv:1504.06329 [pdf, other]

Analysis of Stopping Active Learning based on Stabilizing Predictions

Authors: Michael Bloodgood, John Grothendieck

Abstract: Within the natural language processing (NLP) community, active learning has been widely investigated and applied in order to alleviate the annotation bottleneck faced by developers of new NLP systems and technologies. This paper presents the first theoretical analysis of stopping active learning based on stabilizing predictions (SP). The analysis has revealed three elements that are central to the… ▽ More Within the natural language processing (NLP) community, active learning has been widely investigated and applied in order to alleviate the annotation bottleneck faced by developers of new NLP systems and technologies. This paper presents the first theoretical analysis of stopping active learning based on stabilizing predictions (SP). The analysis has revealed three elements that are central to the success of the SP method: (1) bounds on Cohen's Kappa agreement between successively trained models impose bounds on differences in F-measure performance of the models; (2) since the stop set does not have to be labeled, it can be made large in practice, helping to guarantee that the results transfer to previously unseen streams of examples at test/application time; and (3) good (low variance) sample estimates of Kappa between successive models can be obtained. Proofs of relationships between the level of Kappa agreement and the difference in performance between consecutive models are presented. Specifically, if the Kappa agreement between two models exceeds a threshold T (where $T>0$), then the difference in F-measure performance between those models is bounded above by $\frac{4(1-T)}{T}$ in all cases. If precision of the positive conjunction of the models is assumed to be $p$, then the bound can be tightened to $\frac{4(1-T)}{(p+1)T}$. △ Less

Submitted 23 April, 2015; originally announced April 2015.

Comments: 10 pages, 8 tables; appeared in Proceedings of the Seventeenth Conference on Computational Natural Language Learning, August 2013

ACM Class: I.5.1; I.5.4; G.3; I.2.7; I.2.6

Journal ref: In Proceedings of the Seventeenth Conference on Computational Natural Language Learning, pages 10-19, Sofia, Bulgaria, August 2013. Association for Computational Linguistics

arXiv:1503.01190 [pdf, other]

Statistical modality tagging from rule-based annotations and crowdsourcing

Authors: Vinodkumar Prabhakaran, Michael Bloodgood, Mona Diab, Bonnie Dorr, Lori Levin, Christine D. Piatko, Owen Rambow, Benjamin Van Durme

Abstract: We explore training an automatic modality tagger. Modality is the attitude that a speaker might have toward an event or state. One of the main hurdles for training a linguistic tagger is gathering training data. This is particularly problematic for training a tagger for modality because modality triggers are sparse for the overwhelming majority of sentences. We investigate an approach to automatic… ▽ More We explore training an automatic modality tagger. Modality is the attitude that a speaker might have toward an event or state. One of the main hurdles for training a linguistic tagger is gathering training data. This is particularly problematic for training a tagger for modality because modality triggers are sparse for the overwhelming majority of sentences. We investigate an approach to automatically training a modality tagger where we first gathered sentences based on a high-recall simple rule-based modality tagger and then provided these sentences to Mechanical Turk annotators for further annotation. We used the resulting set of training data to train a precise modality tagger using a multi-class SVM that delivers good performance. △ Less

Submitted 3 March, 2015; originally announced March 2015.

Comments: 8 pages, 6 tables; appeared in Proceedings of the Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics, July 2012; In Proceedings of the Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics, pages 57-64, Jeju, Republic of Korea, July 2012. Association for Computational Linguistics

ACM Class: I.2.7; I.2.6; I.5.1; I.5.4

Journal ref: In Proceedings of the Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics, pages 57-64, Jeju, Republic of Korea, July 2012. Association for Computational Linguistics

arXiv:1502.01682 [pdf, other]

Use of Modality and Negation in Semantically-Informed Syntactic MT

Authors: Kathryn Baker, Michael Bloodgood, Bonnie J. Dorr, Chris Callison-Burch, Nathaniel W. Filardo, Christine Piatko, Lori Levin, Scott Miller

Abstract: This paper describes the resource- and system-building efforts of an eight-week Johns Hopkins University Human Language Technology Center of Excellence Summer Camp for Applied Language Exploration (SCALE-2009) on Semantically-Informed Machine Translation (SIMT). We describe a new modality/negation (MN) annotation scheme, the creation of a (publicly available) MN lexicon, and two automated MN tagge… ▽ More This paper describes the resource- and system-building efforts of an eight-week Johns Hopkins University Human Language Technology Center of Excellence Summer Camp for Applied Language Exploration (SCALE-2009) on Semantically-Informed Machine Translation (SIMT). We describe a new modality/negation (MN) annotation scheme, the creation of a (publicly available) MN lexicon, and two automated MN taggers that we built using the annotation scheme and lexicon. Our annotation scheme isolates three components of modality and negation: a trigger (a word that conveys modality or negation), a target (an action associated with modality or negation) and a holder (an experiencer of modality). We describe how our MN lexicon was semi-automatically produced and we demonstrate that a structure-based MN tagger results in precision around 86% (depending on genre) for tagging of a standard LDC data set. We apply our MN annotation scheme to statistical machine translation using a syntactic framework that supports the inclusion of semantic annotations. Syntactic tags enriched with semantic annotations are assigned to parse trees in the target-language training texts through a process of tree grafting. While the focus of our work is modality and negation, the tree grafting procedure is general and supports other types of semantic information. We exploit this capability by including named entities, produced by a pre-existing tagger, in addition to the MN elements produced by the taggers described in this paper. The resulting system significantly outperformed a linguistically naive baseline model (Hiero), and reached the highest scores yet reported on the NIST 2009 Urdu-English test set. This finding supports the hypothesis that both syntactic and semantic information can improve translation quality. △ Less

Submitted 5 February, 2015; originally announced February 2015.

Comments: 28 pages, 13 figures, 2 tables; appeared in Computational Linguistics, 38(2):411-438, 2012

ACM Class: I.2.7; I.2.6; I.5.1; I.5.4

Journal ref: Computational Linguistics, 38(2):411-438, 2012

arXiv:1501.03191 [pdf, other]

Annotating Cognates and Etymological Origin in Turkic Languages

Authors: Benjamin S. Mericli, Michael Bloodgood

Abstract: Turkic languages exhibit extensive and diverse etymological relationships among lexical items. These relationships make the Turkic languages promising for exploring automated translation lexicon induction by leveraging cognate and other etymological information. However, due to the extent and diversity of the types of relationships between words, it is not clear how to annotate such information. I… ▽ More Turkic languages exhibit extensive and diverse etymological relationships among lexical items. These relationships make the Turkic languages promising for exploring automated translation lexicon induction by leveraging cognate and other etymological information. However, due to the extent and diversity of the types of relationships between words, it is not clear how to annotate such information. In this paper, we present a methodology for annotating cognates and etymological origin in Turkic languages. Our method strives to balance the amount of research effort the annotator expends with the utility of the annotations for supporting research on improving automated translation lexicon induction. △ Less

Submitted 13 January, 2015; originally announced January 2015.

Comments: 5 pages, 8 tables; appeared in Proceedings of the First Workshop on Language Resources and Technologies for Turkic Languages at the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 47-51, Istanbul, Turkey, May 2012. European Language Resources Association

ACM Class: I.2.7

Journal ref: In Proceedings of the First Workshop on Language Resources and Technologies for Turkic Languages at LREC'12, pages 47-51, Istanbul, Turkey, May 2012. European Language Resources Association

arXiv:1411.0007 [pdf]

Rapid Adaptation of POS Tagging for Domain Specific Uses

Authors: John E. Miller, Michael Bloodgood, Manabu Torii, K. Vijay-Shanker

Abstract: Part-of-speech (POS) tagging is a fundamental component for performing natural language tasks such as parsing, information extraction, and question answering. When POS taggers are trained in one domain and applied in significantly different domains, their performance can degrade dramatically. We present a methodology for rapid adaptation of POS taggers to new domains. Our technique is unsupervised… ▽ More Part-of-speech (POS) tagging is a fundamental component for performing natural language tasks such as parsing, information extraction, and question answering. When POS taggers are trained in one domain and applied in significantly different domains, their performance can degrade dramatically. We present a methodology for rapid adaptation of POS taggers to new domains. Our technique is unsupervised in that a manually annotated corpus for the new domain is not necessary. We use suffix information gathered from large amounts of raw text as well as orthographic information to increase the lexical coverage. We present an experiment in the Biological domain where our POS tagger achieves results comparable to POS taggers specifically trained to this domain. △ Less

Submitted 31 October, 2014; originally announced November 2014.

Comments: 2 pages, 2 tables; appeared in Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology, June 2006

ACM Class: I.2.7; I.2.6; I.5.1; I.5.4

Journal ref: In Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology, pages 118-119, New York, New York, June 2006. Association for Computational Linguistics

arXiv:1410.8553 [pdf, other]

A random forest system combination approach for error detection in digital dictionaries

Authors: Michael Bloodgood, Peng Ye, Paul Rodrigues, David Zajic, David Doermann

Abstract: When digitizing a print bilingual dictionary, whether via optical character recognition or manual entry, it is inevitable that errors are introduced into the electronic version that is created. We investigate automating the process of detecting errors in an XML representation of a digitized print dictionary using a hybrid approach that combines rule-based, feature-based, and language model-based m… ▽ More When digitizing a print bilingual dictionary, whether via optical character recognition or manual entry, it is inevitable that errors are introduced into the electronic version that is created. We investigate automating the process of detecting errors in an XML representation of a digitized print dictionary using a hybrid approach that combines rule-based, feature-based, and language model-based methods. We investigate combining methods and show that using random forests is a promising approach. We find that in isolation, unsupervised methods rival the performance of supervised methods. Random forests typically require training data so we investigate how we can apply random forests to combine individual base methods that are themselves unsupervised without requiring large amounts of training data. Experiments reveal empirically that a relatively small amount of data is sufficient and can potentially be further reduced through specific selection criteria. △ Less

Submitted 30 October, 2014; originally announced October 2014.

Comments: 9 pages, 7 figures, 10 tables; appeared in Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data, April 2012

ACM Class: I.2.7; I.2.6; I.5.1; I.5.4

Journal ref: In Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data, pages 78-86, Avignon, France, April 2012. Association for Computational Linguistics

arXiv:1410.8149 [pdf]

Detecting Structural Irregularity in Electronic Dictionaries Using Language Modeling

Authors: Paul Rodrigues, David Zajic, David Doermann, Michael Bloodgood, Peng Ye

Abstract: Dictionaries are often developed using tools that save to Extensible Markup Language (XML)-based standards. These standards often allow high-level repeating elements to represent lexical entries, and utilize descendants of these repeating elements to represent the structure within each lexical entry, in the form of an XML tree. In many cases, dictionaries are published that have errors and inconsi… ▽ More Dictionaries are often developed using tools that save to Extensible Markup Language (XML)-based standards. These standards often allow high-level repeating elements to represent lexical entries, and utilize descendants of these repeating elements to represent the structure within each lexical entry, in the form of an XML tree. In many cases, dictionaries are published that have errors and inconsistencies that are expensive to find manually. This paper discusses a method for dictionary writers to quickly audit structural regularity across entries in a dictionary by using statistical language modeling. The approach learns the patterns of XML nodes that could occur within an XML tree, and then calculates the probability of each XML tree in the dictionary against these patterns to look for entries that diverge from the norm. △ Less

Submitted 29 October, 2014; originally announced October 2014.

Comments: 6 pages, 2 figures, 11 tables; appeared in Proceedings of Electronic Lexicography in the 21st Century (eLex), November 2011

ACM Class: I.2.7; I.2.6; I.5.1; I.5.4

Journal ref: In Proceedings of Electronic Lexicography in the 21st Century (eLex), pages 227-232, Bled, Slovenia, November 2011. Trojina Institute for Applied Slovene Studies

arXiv:1410.7787 [pdf]

Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language

Authors: David Zajic, Michael Maxwell, David Doermann, Paul Rodrigues, Michael Bloodgood

Abstract: We describe a paradigm for combining manual and automatic error correction of noisy structured lexicographic data. Modifications to the structure and underlying text of the lexicographic data are expressed in a simple, interpreted programming language. Dictionary Manipulation Language (DML) commands identify nodes by unique identifiers, and manipulations are performed using simple commands such as… ▽ More We describe a paradigm for combining manual and automatic error correction of noisy structured lexicographic data. Modifications to the structure and underlying text of the lexicographic data are expressed in a simple, interpreted programming language. Dictionary Manipulation Language (DML) commands identify nodes by unique identifiers, and manipulations are performed using simple commands such as create, move, set text, etc. Corrected lexicons are produced by applying sequences of DML commands to the source version of the lexicon. DML commands can be written manually to repair one-off errors or generated automatically to correct recurring problems. We discuss advantages of the paradigm for the task of editing digital bilingual dictionaries. △ Less

Submitted 28 October, 2014; originally announced October 2014.

Comments: 5 pages, 3 figures, 1 table; appeared in Proceedings of Electronic Lexicography in the 21st Century (eLex), November 2011

Journal ref: In Proceedings of Electronic Lexicography in the 21st Century (eLex), pages 297-301, Bled, Slovenia, November 2011. Trojina Institute for Applied Slovene Studies

arXiv:1410.5877 [pdf, other]

Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation

Authors: Michael Bloodgood, Chris Callison-Burch

Abstract: We explore how to improve machine translation systems by adding more translation data in situations where we already have substantial resources. The main challenge is how to buck the trend of diminishing returns that is commonly encountered. We present an active learning-style data solicitation algorithm to meet this challenge. We test it, gathering annotations via Amazon Mechanical Turk, and find… ▽ More We explore how to improve machine translation systems by adding more translation data in situations where we already have substantial resources. The main challenge is how to buck the trend of diminishing returns that is commonly encountered. We present an active learning-style data solicitation algorithm to meet this challenge. We test it, gathering annotations via Amazon Mechanical Turk, and find that we get an order of magnitude increase in performance rates of improvement. △ Less

Submitted 21 October, 2014; originally announced October 2014.

Comments: 11 pages, 14 figures; appeared in Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, July 2010

ACM Class: I.2.7; I.2.6; I.5.1; I.5.4

Journal ref: In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 854-864, Uppsala, Sweden, July 2010. Association for Computational Linguistics

arXiv:1410.5491 [pdf, other]

Using Mechanical Turk to Build Machine Translation Evaluation Sets

Authors: Michael Bloodgood, Chris Callison-Burch

Abstract: Building machine translation (MT) test sets is a relatively expensive task. As MT becomes increasingly desired for more and more language pairs and more and more domains, it becomes necessary to build test sets for each case. In this paper, we investigate using Amazon's Mechanical Turk (MTurk) to make MT test sets cheaply. We find that MTurk can be used to make test sets much cheaper than professi… ▽ More Building machine translation (MT) test sets is a relatively expensive task. As MT becomes increasingly desired for more and more language pairs and more and more domains, it becomes necessary to build test sets for each case. In this paper, we investigate using Amazon's Mechanical Turk (MTurk) to make MT test sets cheaply. We find that MTurk can be used to make test sets much cheaper than professionally-produced test sets. More importantly, in experiments with multiple MT systems, we find that the MTurk-produced test sets yield essentially the same conclusions regarding system performance as the professionally-produced test sets yield. △ Less

Submitted 20 October, 2014; originally announced October 2014.

Comments: 4 pages, 2 tables; appeared in Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, June 2010

ACM Class: I.2.7; I.2.6; I.5.1; I.5.4

Journal ref: In Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, pages 208-211, Los Angeles, California, June 2010. Association for Computational Linguistics

arXiv:1410.4868 [pdf, other]

A Modality Lexicon and its use in Automatic Tagging

Authors: Kathryn Baker, Michael Bloodgood, Bonnie J. Dorr, Nathaniel W. Filardo, Lori Levin, Christine Piatko

Abstract: This paper describes our resource-building results for an eight-week JHU Human Language Technology Center of Excellence Summer Camp for Applied Language Exploration (SCALE-2009) on Semantically-Informed Machine Translation. Specifically, we describe the construction of a modality annotation scheme, a modality lexicon, and two automated modality taggers that were built using the lexicon and annotat… ▽ More This paper describes our resource-building results for an eight-week JHU Human Language Technology Center of Excellence Summer Camp for Applied Language Exploration (SCALE-2009) on Semantically-Informed Machine Translation. Specifically, we describe the construction of a modality annotation scheme, a modality lexicon, and two automated modality taggers that were built using the lexicon and annotation scheme. Our annotation scheme is based on identifying three components of modality: a trigger, a target and a holder. We describe how our modality lexicon was produced semi-automatically, expanding from an initial hand-selected list of modality trigger words and phrases. The resulting expanded modality lexicon is being made publicly available. We demonstrate that one tagger---a structure-based tagger---results in precision around 86% (depending on genre) for tagging of a standard LDC data set. In a machine translation application, using the structure-based tagger to annotate English modalities on an English-Urdu training corpus improved the translation quality score for Urdu by 0.3 Bleu points in the face of sparse training data. △ Less

Submitted 17 October, 2014; originally announced October 2014.

Comments: 6 pages, 5 figures; appeared in Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10), May 2010

ACM Class: I.2.7

Journal ref: In Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10), pages 1402-1407, Valletta, Malta, May 2010. European Language Resources Association

arXiv:1409.7085 [pdf, other]

Semantically-Informed Syntactic Machine Translation: A Tree-Grafting Approach

Authors: Kathryn Baker, Michael Bloodgood, Chris Callison-Burch, Bonnie J. Dorr, Nathaniel W. Filardo, Lori Levin, Scott Miller, Christine Piatko

Abstract: We describe a unified and coherent syntactic framework for supporting a semantically-informed syntactic approach to statistical machine translation. Semantically enriched syntactic tags assigned to the target-language training texts improved translation quality. The resulting system significantly outperformed a linguistically naive baseline model (Hiero), and reached the highest scores yet reporte… ▽ More We describe a unified and coherent syntactic framework for supporting a semantically-informed syntactic approach to statistical machine translation. Semantically enriched syntactic tags assigned to the target-language training texts improved translation quality. The resulting system significantly outperformed a linguistically naive baseline model (Hiero), and reached the highest scores yet reported on the NIST 2009 Urdu-English translation task. This finding supports the hypothesis (posed by many researchers in the MT community, e.g., in DARPA GALE) that both syntactic and semantic information are critical for improving translation quality---and further demonstrates that large gains can be achieved for low-resource languages with different word order than English. △ Less

Submitted 24 September, 2014; originally announced September 2014.

Comments: 10 pages, 7 figures, 3 tables; appeared in Proceedings of the Ninth Conference of the Association for Machine Translation in the Americas (AMTA), October 2010

ACM Class: I.2.7; I.2.6; I.5.1; I.5.4

Journal ref: In Proceedings of the Ninth Conference of the Association for Machine Translation in the Americas (AMTA), Denver, Colorado, October 2010

arXiv:1409.5165 [pdf, other]

A Method for Stopping Active Learning Based on Stabilizing Predictions and the Need for User-Adjustable Stopping

Authors: Michael Bloodgood, K. Vijay-Shanker

Abstract: A survey of existing methods for stopping active learning (AL) reveals the needs for methods that are: more widely applicable; more aggressive in saving annotations; and more stable across changing datasets. A new method for stopping AL based on stabilizing predictions is presented that addresses these needs. Furthermore, stopping methods are required to handle a broad range of different annotatio… ▽ More A survey of existing methods for stopping active learning (AL) reveals the needs for methods that are: more widely applicable; more aggressive in saving annotations; and more stable across changing datasets. A new method for stopping AL based on stabilizing predictions is presented that addresses these needs. Furthermore, stopping methods are required to handle a broad range of different annotation/performance tradeoff valuations. Despite this, the existing body of work is dominated by conservative methods with little (if any) attention paid to providing users with control over the behavior of stopping methods. The proposed method is shown to fill a gap in the level of aggressiveness available for stopping AL and supports providing users with control over stopping behavior. △ Less

Submitted 17 September, 2014; originally announced September 2014.

Comments: 9 pages, 3 figures, 5 tables; appeared in Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL-2009), June 2009

ACM Class: I.2.6; I.2.7; I.5.1; I.5.4; G.3

Journal ref: In Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL-2009), pages 39-47, Boulder, Colorado, June 2009. Association for Computational Linguistics

arXiv:1409.4835 [pdf, ps, other]

Taking into Account the Differences between Actively and Passively Acquired Data: The Case of Active Learning with Support Vector Machines for Imbalanced Datasets

Authors: Michael Bloodgood, K. Vijay-Shanker

Abstract: Actively sampled data can have very different characteristics than passively sampled data. Therefore, it's promising to investigate using different inference procedures during AL than are used during passive learning (PL). This general idea is explored in detail for the focused case of AL with cost-weighted SVMs for imbalanced data, a situation that arises for many HLT tasks. The key idea behind t… ▽ More Actively sampled data can have very different characteristics than passively sampled data. Therefore, it's promising to investigate using different inference procedures during AL than are used during passive learning (PL). This general idea is explored in detail for the focused case of AL with cost-weighted SVMs for imbalanced data, a situation that arises for many HLT tasks. The key idea behind the proposed InitPA method for addressing imbalance is to base cost models during AL on an estimate of overall corpus imbalance computed via a small unbiased sample rather than the imbalance in the labeled training data, which is the leading method used during PL. △ Less

Submitted 16 September, 2014; originally announced September 2014.

Comments: 4 pages, 5 figures; appeared in Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers, pages 137-140, Boulder, Colorado, June 2009. Association for Computational Linguistics

ACM Class: I.2.6; I.2.7; I.5.1; I.5.4

Journal ref: Proceedings of HLT: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Short Papers, pages 137-140, Boulder, Colorado, June 2009. Association for Computational Linguistics

arXiv:1409.3881 [pdf]

An Approach to Reducing Annotation Costs for BioNLP

Authors: Michael Bloodgood, K. Vijay-Shanker

Abstract: There is a broad range of BioNLP tasks for which active learning (AL) can significantly reduce annotation costs and a specific AL algorithm we have developed is particularly effective in reducing annotation costs for these tasks. We have previously developed an AL algorithm called ClosestInitPA that works best with tasks that have the following characteristics: redundancy in training material, bur… ▽ More There is a broad range of BioNLP tasks for which active learning (AL) can significantly reduce annotation costs and a specific AL algorithm we have developed is particularly effective in reducing annotation costs for these tasks. We have previously developed an AL algorithm called ClosestInitPA that works best with tasks that have the following characteristics: redundancy in training material, burdensome annotation costs, Support Vector Machines (SVMs) work well for the task, and imbalanced datasets (i.e. when set up as a binary classification problem, one class is substantially rarer than the other). Many BioNLP tasks have these characteristics and thus our AL algorithm is a natural approach to apply to BioNLP tasks. △ Less

Submitted 12 September, 2014; originally announced September 2014.

Comments: 2 pages, 1 figure, 5 tables; appeared in Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing at ACL (Association for Computational Linguistics) 2008

ACM Class: I.2.7; I.2.6; I.5.1; I.5.4

Journal ref: In Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing, pages 104-105, Columbus, Ohio, June 2008. Association for Computational Linguistics

Showing 1–34 of 34 results for author: Bloodgood, M