Joseph Turian, Benjamin Wellington, and I. Dan Melamed (2006). Scalable Discriminative Learning for Natural Language Parsing and Translation, [PS] [PDF] Proceedings of the 20th Annual Conference on Neural Information Processing Systems (NIPS), Vancouver, BC.
Benjamin Wellington, Joseph Turian, Chris Pike, and I. Dan Melamed (2006). Scalable Purely-Discriminative Training for Word and Tree Transducers, [PS] [PDF] Proceedings of the 7th Biennial Conference of the Association for Machine Translation in the Americas (AMTA), Boston, MA.
Benjamin Wellington, Sonjia Waxmonsky, and I. Dan Melamed (2006). Empirical Lower Bounds on the Complexity of Translational Equivalence, [PS] [PDF] Proceedings of the 44th annual conference of the Association for Computational Linguistics (ACL), Sydney, Australia
Joseph Turian and I. Dan Melamed (2006).
Advances in Discriminative Parsing.
[PS]
[PDF]
Proceedings of the 44th annual conference of the Association for Computational Linguistics (ACL), Sydney,
Australia.
Joseph Turian and I. Dan Melamed (2006).
Computational Challenges in Parsing by Classification.
[PS]
[PDF]
HLT-NAACL Workshop on Computationally Hard Problems and Joint Inference
in Speech and Language Processing, New York, NY.
Christo Kirov, Esther Iserovich, Brad Wellington, and I. Dan Melamed (2006). Plagiarism Detection in Prose and in Programs, NYU Undergraduate Research Conference, April. Also available as Proteus Project TR 06-012.
Sonjia Waxmonsky and I. Dan Melamed (2006). A Dynamic Data Structure for Parsing with Discontinuous Constituents, Proteus Project TR 06-001. (pdf) January.
A. Burbank, M. Carpuat, S. Clark, M. Dreyer, P. Fox, D. Groves, K. Hall, M. Hearne, I. D. Melamed, Y. Shen, A. Way, B. Wellington, and D. Wu (2005) Final Report of the 2005 Language Engineering Workshop on Statistical Machine Translation by Parsing, http://www.clsp.jhu.edu/ws2005/groups/statistical/documents/finalreport.pdf
Joseph Turian and I. Dan Melamed (2005) Constituent Parsing by Classification [PS], [PDF] Proceedings of the 9th International Workshop on Parsing Technologies (IWPT), Vancouver, BC.
I. Dan Melamed (2004). Algorithms for Syntax-Aware Statistical Machine Translation [PS] [PDF] Slides from invited talk (PDF) Proceedings of the Conference on Theoretical and Methodological Issues in Machine Translation (TMI), Baltimore, MD.
I. Dan Melamed (2004). Statistical Machine Translation by Parsing [PS] [PDF] Proceedings of the 42nd Annual Conference of the Association for Computational Linguistics (ACL), Barcelona, Spain. This version has a couple of typos corrected. A longer (easier to understand) version is here: [PS] [PDF].
I. Dan Melamed, Giorgio Satta, and Ben Wellington (2004). Generalized Multitext Grammars [PS][PDF] Proceedings of the 42nd Annual Conference of the Association for Computational Linguistics (ACL), Barcelona, Spain. This version has a couple of typos corrected.
Chris Pike and I. Dan Melamed (2004). An Automatic Filter for Non-Parallel Texts [PS] [PDF] Proceedings of the 42nd Annual Conference of the Association for Computational Linguistics (ACL) (companion volume), Barcelona, Spain.
Ali Argyle, Darren Jahnel, Jon Liebowitz, Sachiko Omatoi, Jeremy Shapiro, Graig Warner, and I. Dan Melamed (2004). A Bitext Harvesting and Distribution System [doc] Proteus Project Technical Report #04-001.
Joseph P. Turian, Luke Shen, and I. Dan Melamed (2003) "Evaluation of Machine Translation and its Evaluation," [PS]
[PDF]
Proteus technical report #03-005, a revised version of the paper
presented at MT Summit IX, New Orleans, LA.
I. Dan Melamed, Ryan Green and Joseph P. Turian (2003). Precision and Recall of Machine Translation [PS] [PDF], Proteus technical report #03-004, a revised version of the paper presented at NAACL/HLT 2003, Edmonton, AB.
I. Dan Melamed (2003). Multitext Grammars and Synchronous Parsers [PS] [PDF], Proceedings of NAACL/HLT. (This version has a couple of typos corrected.)
![]() |
I. Dan Melamed (2001). Empirical Methods for Exploiting Parallel Texts MIT Press. Click here for the table of contents. |
Mosleh H. Al-Adhaileh, Tang Enya Kong, and I. Dan Melamed (2001). Malay-English Bitext Mapping and Alignment Using SIMR/GSA Algorithms" [PDF][DOC], Malaysian National Conference on Research and Development in Computer Science (REDECS'01), Selangor Darul Ehsan, Malaysia. Mosleh's beautiful PPT slides are here.
I. Dan Melamed (2000). Models of Translational Equivalence among Words, Computational Linguistics 26(2), 221-249, June.
I. Dan Melamed, Isabelle Moulinier, Elie Naulleau & Changwen Yang (2000). Les bases de la recherce d'information en langue français à West Group, 2nd Amaryllis conference on French information retrieval. Paris, France.
I. Dan Melamed and Philip Resnik (2000). Tagger Evaluation Given Hierarchical Tag Sets, Computers and the Humanities 34(1-2). Special issue on SENSEVAL.
Yaser Al-Onaizan, Jan Curin, Michael Jahr, Kevin Knight, John Lafferty, Dan Melamed, Franz-Josef Och, David Purdy, Noah A. Smith and David Yarowsky (1999). Statistical Machine Translation: Final Report, Johns Hopkins University 1999 Summer Workshop on Language Engineering, Center for Speech and Language Processing, Baltimore, MD. http://www.clsp.jhu.edu/ws99/projects/mt/final_report/mt-final-report.ps
I. Dan Melamed and Hang Li (1999). Review of H. Schutze's Ambiguity Resolution in Language Learning,in Computational Linguistics 25(3).
I. Dan Melamed (1999). Bitext Maps and Alignment via Pattern Recognition, Computational Linguistics 25(1)107-130, March.
I. Dan Melamed (1999). Module-Level Description of the RWIN French IR System, West Group internal report, March.
I. Dan Melamed (1998). Technological Challenges in non-English Information, West Group internal report.
I. Dan Melamed (1998). Empirical Methods for MT Lexicon Construction, in L. Gerber and D. Farwell, Eds., Machine Translation and the Information Soup, Springer-Verlag.
I. Dan Melamed and Mitchell P. Marcus (1998). Automatic Construction of Chinese-English Translation Lexicons, IRCS Technical Report #98-28.
I. Dan Melamed (1998). Manual Annotation of Translational Equivalence: The Blinker Project, IRCS Technical Report #98-07.
I. Dan Melamed (1998). Annotation Style Guide for the Blinker Project, IRCS Technical Report #98-06.
I. Dan Melamed (1998). Models of Co-occurrence, IRCS Technical Report #98-05.
David Graff, I. Dan Melamed, and Paul Morgovsky. (1997) ``Hansard Corpus: Parallel Text in English and French'' on CD-ROM, Linguistic Data Consortium. N.B.: The set-b documentation on the CD contained a couple of errors. The revised version is here.
I. Dan Melamed (1997). Automatic Discovery of Non-Compositional Compounds in Parallel Data, 2nd Conference on Empirical Methods in Natural Language Processing (EMNLP), Providence, RI.
I. Dan Melamed (1997). A Word-to-Word Model of Translational Equivalence, 35th Conference of the Association for Computational Linguistics (ACL), Madrid, Spain. (This version has some typos corrected.)
I. Dan Melamed (1997). A Portable Algorithm for Mapping Bitext Correspondence, 35th Conference of the Association for Computational Linguistics (ACL), Madrid, Spain.
I. Dan Melamed (1997). Measuring Semantic Entropy, SIGLEX Workshop on Tagging Text with Lexical Semantics, Washington, DC.
Philip Resnik and I. Dan Melamed (1997). Semi-Automatic Acquisition of Domain-Specific Translation Lexicons, Fifth Conference on Applied Natural Language Processing, Washington, DC.
I. Dan Melamed (1997). A Scalable Architecture for Bilingual Lexicography, Dept. of Computer and Information Science Technical Report #MS-CIS-97-01.
I. Dan Melamed (1996). Automatic Construction of Clean Broad-Coverage Translation Lexicons, 2nd Conference of the Association for Machine Translation in the Americas (AMTA), Montreal, PQ.
I. Dan Melamed (1996). Porting SIMR to New Language Pairs, IRCS Technical Report #96-26.
I. Dan Melamed (1996). Automatic Detection of Omissions in Translations, IRCS Technical Report #96-23, a revised version of the paper presented at the 16th International Conference on Computational Linguistics (COLING), Copenhagen, Denmark.
I. Dan Melamed (1996). A Geometric Approach to Mapping Bitext Correspondence, IRCS Technical Report #96-22, a revised version of the paper presented at the First Conference on Empirical Methods in Natural Language Processing (EMNLP), Philadelphia, PA, May.
I. Dan Melamed (1995). Automatic Evaluation and Uniform Filter Cascades for Inducing N-best Translation Lexicons, Third Workshop on Very Large Corpora (WVLC3), Boston, MA.