Spinnakr Active Analytics

2014

Montserrat Marimon; Núria Bel; Lluís Padró (2014) Automatic Selection of HPSG-Parsed Sentences for Treebank Construction, Computational Linguistics, vol 40, n 3, pp 523-531

Borja Balle, Xavier Carreras, Franco M. Luque, Ariadna Quattoni (2014) Spectral Learning of Weighted Automata: A Forward-Backward Perspective. Machine Learning, Special Issue on Grammatical Inference, vol 96, n 1-2, pp 33-63

Marina Lloberes; Irene Castellón; Lluís Padró; Edgar Gonzàlez (2014) ParTes. Test Suite for Parsing Evaluation. Procesamiento del Lenguaje Natural, n 53, pp 87-94

Rusu, D., Fortuna, B. and Mladenić, D. (2014) Measuring Concept Similarity in Ontologies using Weighted Concept Paths. Applied Ontology 9, no 1, pp 65-95

Evgenia Belyaeva, Aljaž Košmerlj, Andrej Muhič, Jan Rupnik, Flavio Fuart (2015) Using Semantic Data to Improve Linking of Cross-Lingual Clusters, Journal of Web Semantics, Special issue on machine learning and data mining for the Semantic Web

Lei Zhang, Achim Rettinger, Semantic Annotation, Analysis and Comparison: A Multilingual and Cross-lingual Text Analytics Toolkit, Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics, ACL, Gothenburg, Sweden, 2014-04.

Xavier Carreras, Lluís Padró, Lei Zhang, Achim Rettinger, Zhixing Li, Esteban García-Cuesta, Željko Agić, Božo Bekavac, Blaz Fortuna and Tadej Štajner, XLike Project Language Analysis Services, Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics, ACL, Gothenburg, Sweden, 2014-04.

Gregor Leban, Janez Brank, Marko Grobelnik, Blaž Fortuna, Event Registry – Learning About World Events From News, WWW2014, Seoul, Korea, 2014-04 (http://www2014.kr/wp-content/uploads/2014/05/companion_p107.pdf)

Krešimir Šojat, Matea Srebačić, Marko Tadić, Tin Pavelić, CroDeriV: a New Resource for Processing Croatian Morphology, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC2014), ELRA, Reykjavik-Paris, 2014-05.

Lluís Padró, Željko Agić, Xavier Carreras, Blaž Fortuna, Esteban García-Cuesta, Zhixing Li, Tadej Štajner, Marko Tadić, Language Processing Infrastructure in the XLike Project, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC2014), ELRA, Reykjavik-Paris, 2014-05.

Željko Agić, Daša Berović, Danijela Merkler, Marko Tadić, Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC2014), ELRA, Reykjavik-Paris, 2014-05.

Achim Rettinger, Lei Zhang, Daša Berović, Danijela Merkler, Matea Srebačić, Marko Tadić, RECSA: Resource for Evaluating Cross-lingual Semantic Annotation, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC2014), ELRA, Reykjavik-Paris, 2014-05.

Iñaki Alegria, Nora Aranberri, Pere Comas, Victor Fresno, Pablo Gamallo, Lluís Padró, Iñaki San Vicente, Jordi Turmo and Arkaitz Zubiagam TweetNorm_es: an Annotated Corpus for Spanish Microtext Normalization, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC2014), ELRA, Reykjavik-Paris, 2014-05.

Marko Tadić, XLike: Cross-lingual Knowledge Extraction, Proceedings of the Seventeenth Annual Conference of the European Association for Machine Translation, EAMT, Dubrovnik, Croatia, 2014-06.

Ariadna Quattoni, Borja Balle, Xavier Carreras, Amir Globerson, Spectral Regularization for Max-Margin Sequence Tagging, JMLR Workshop and Conference Proceedings; Volume 32: Proceedings of The 31st International Conference on Machine Learning, Beijing, China, 2014-06.

Juanzi Li, Cross-lingual Knowledge Validation Based Taxonomy Derivation from Heterogeneous Online Wikis, Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, Quebec, Canada, 2014-07, pp 180-186 (http://www.aaai.org/ocs/index.php/AAAI/AAAI14/paper/viewFile/8260/8418)

Pranava S. Madhyastha, Xavier Carreras, Ariadna Quattoni, Learning Task-specific Bilexical Embeddings, Proceedings of COLING2014, Dublin, Ireland, 2014-08.

Aljaž Košmerlj, Jenya Belyaeva, Gregor Leban, Blaž Fortuna, Marko Grobelnik, Crowdsourcing Event Extraction, Proceedings of Data Science for News Publishing Workshop (NewsKDD2014), New York, USA, 2014-08 (http://ailab.ijs.si/~blazf/NewsKDD2014/submissions/newskdd2014_submission_8. pdf)

Lei Zhang, Achim Rettinger, X-LiSA: Cross-lingual Semantic Annotation, Proceedings of the VLDB Endowment (PVLDB), The 40th International Conference on Very Large Data Bases (VLDB), Hangzhou, China, 2014-09

Krešimir Šojat, Matea Srebačić, Tin Pavelić, CroDeriV 2.0: Initial Experiments, In: Przepiórkowski, Adam; Ogrodniczuk, Maciej (eds.) Advances in Natural Language Processing, LNCS8686, Springer, Heidelberg, 2014.

Janez Brank, Gregor Leban, Marko Grobelnik, A High-Performance Multithreaded Approach for Clustering a Stream of Documents, Proceedings of the SiKDD2014 conference, Ljubljana, Slovenia, 2014-10 (http://ailab.ijs.si/dunja/SiKDD2014/Papers/Brank_Clustering.pdf)

Gregor Leban, Janez Brank, Marko Grobelnik, Blaž Fortuna, Cross-lingual detection of world events from news articles, Proceedings of the ISWC2014 Posters & Demonstrations Track within the 13th International Semantic Web Conference (ISWC2014), Riva del Garda, Italy, 2014-10 (http://ceur-ws.org/Vol-1272/paper_19.pdf)

Lei Zhang, Michael Färber, Thanh Tran, Achim Rettinger, Exploiting Semantic Annotations for Entity-based Information Retrieval, Proceedings of the ISWC2014 Posters & Demonstrations Track within the 13th International Semantic Web Conference (ISWC2014), Riva del Garda, Italy, 2014-10 (http://ceur-ws.org/Vol-1272/paper_134.pdf)

Lei Zhang, Achim Rettinger, Steffen Thoma, Bridging the Gap between Cross-lingual NLP and DBpedia by Exploiting Wikipedia, Proceedings of the NLP&DBpedia Workshop within the 13th International Semantic Web Conference (ISWC2014), Riva del Garda, Italy, 2014-10 (https://nlpdbpedia2014.wordpress.com/programme)

Lluís, Xavier; Carreras, Xavier; Màrquez, Lluís, A Shortest-path Method for Arc-factored Semantic Role Labeling, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP2014), ACL, Doha, Qatar, 2014-10 (http://emnlp2014.org/papers/emnlp2014-proceedings.pdf)

Agić, Željko; Tiedemann, Jörg; Dobrovoljc, Kaja; Krek, Simon; Merkler, Danijela; Može, Sara, Cross-Lingual Dependency Parsing of Related Languages with Rich Morphosyntactic Tagsets, Proceedings of the EMNLP2014 Workshop on Language Technology for Closely Related Languages and Language Variants, ACL, Doha, Qatar, 2014-10 (http://bib.irb.hr/datoteka/716682.agic2014-cross.pdf)

2013

Grobelnik, Marko; Bradeško, Luka; Tadić, Marko, XLike project overview, FASSBL9 Proceedings USB, HDJT, 10/7/2013, Late Summer School Methods for the Linguists of the Future / Formal Approaches to South Slavic and Balkan Languages (FASSBL9), Dubrovnik, Croatia

Jan Rupnik, Andrej Muhic, Blaz Fortuna, Janez Starc, Marko Grobelnik, Michael J Witbrock, Cross-Lingual Technologies: Text to Logic Mapping, Search and Classification over 100 Languages, Cross-Lingual Technologies: Text to Logic Mapping, Search and Classification over 100 Languages, 12/7/2013, NIPS 2013, Lake Tahoe, Nevada, US

Zhigang Wang, Juanzi Li, Zhichun Wang, Shuangjie Li, Mingyang Li, Dongsheng Zhang, Yao Shi, Yongbin Liu, Peng Zhang, Jie Tang, XLore: A Large-scale English-Chinese Bilingual Knowledge Graph, Proceedings of the ISWC 2013 Posters & Demonstrations Track,  CEUR-WS.org 2013 CEUR Workshop Proceedings, 2013, Sydney , Australia

Xavier Lluís, Xavier Carreras, Lluís Màrquez, Joint Arc-factored Parsing of Syntactic and Semantic Dependencies, Transactions of the Association for Computational Linguistics (TACL), 2013-05, 1, 219-230,

Borja Balle, Xavier Carreras, Franco M. Luque, Ariadna Quattoni., Spectral Learning of Weighted Automata: A Forward-Backward Perspective, Machine Learning, Special Issue on Grammatical Inference, 2013-10,

Sapena, Emili; Padró, Lluís; Turmo, Jordi, A Constraint-Based Hypergraph Partitioning Approach to Coreference Resolution, Computational Linguistics vol. 39, n. 4, pg. 847–884. , 2013-12, 39 (4), 847–884, 0891-2017,

Alegria, Iñaki; Aranberri, Nora; Fresno, Víctor; Gamallo, Pablo; Padró, Lluís;  San~Vicente, Iñaki; Turmo, Jordi; Zubiaga, Arkaitz, Introducción a la tarea compartida Tweet-Norm 2013: Normalización léxica de tuits en español, “Proceedings of Workshop on Tweet Normalization at SEPLN (Tweet-Norm)”, 2013-09, 36–45, 978-84-695-8349-4, Madrid, Spain

Agić, Željko; Bekavac, Božo, Domain Dependence of Statistical Named Entity Recognition and Classification in Croatian Texts, Proceedings of the 35th International Conference on Information Technology Interfaces (ITI 2013), Proceedings of the 35th International Conference on Information Technology Interfaces (ITI 2013), 2013, 277-283, 1330-1012, “ITI annual conference”, Cavtat, Croatia

Agić, Željko; Bekavac, Božo, Domain-aware Evaluation of Named Entity Recognition Systems for Croatian, CIT. Journal of Computing and Information Technology, CIT. Journal of Computing and Information Technology, 2013, Vol 21, No 3, 195-209, ISSN 1330-1136 ,

Juanzi Li, Zhichun Wang, Xiao Zhang, Jie Tang, Large scale instance matching via multiple indexes and candidate selection, Knowledge Based Systems, 2013-09, 50, 112-120 ,

Zhichun Wang, Juanzi Li, Yue Zhao, Rossi Setchi, Jie Tang, A unified approach to matching semantic data on the Web, Knowledge Based Systems, 2013, 39, 173-184 ,

Marko Tadić, Language Technologiesfor ingesting and retrieving information, 7/8/2013, Use of Language Technologies in the Danube Region, JRC Ispra, Italy

Achim Rettinger, xLiMe – crossLingual crossMedia knowledge extraction, 6/27/2013, LT-Innovate Summit 2013, Brussels, Belgium

Lei Zhang, Achim Rettinger, Michael Färber, Marko Tadić, A Comparative Evaluation of Cross-Lingual Text Annotation Techniques, “Information Access Evaluation. Multilinguality, Multimodality, and Visualization”, “Lecture Notes in Computer Science”, Volume 8138 2013, 124-135, 978-3-642-40801-4 (Print) 978-3-642-40802-1 (Online), 4th International Conference of the CLEF Initiative, CLEF 2013, Valencia, Spain

Xavier Lluís, Xavier Carreras, Lluís Màrquez, Joint Arc-factored Parsing of Syntactic and Semantic Dependencies, 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013), Sofia, Bulgaria

Xavier Carreras, Structure Prediction with Low-rank Bilinear Forms, ACL-2013 Workshop on Continuous Vector Space Models and their Compositionality (CVSC), Sofia, Bulgaria

Raphael Bailly, Spectral Learning of Hidden Structure, ICML 2013 Worhsop on Spectral Learning, Atlanta, USA

Raphael Bailly, Xavier Carreras, Ariadna Quattoni, Unsupervised Spectral Learning of Finite-state Transducers, Advances in Neural Information Processing Systems (NIPS), Neural Information Processin Systems 2013, Lake Tahoe, USA

Alicia Ageno, Pere R. Comas, Lluís Padró, Jordi Turmo, The TALP-UPC Approach to Tweet-Norm 2013, Proceedings o the Tweet Normalization Workshop at SEPLN-2013, Tweet Normalization Workshop at SEPLN-2013, Madrid, Spain

Grobelnik, Marko; Bradeško, Luka; Tadić, Marko, XLike project overview, FASSBL9 Proceedings USB, HDJT, 10/7/2013, Late Summer School Methods for the Linguists of the Future / Formal Approaches to South Slavic and Balkan Languages (FASSBL9), Dubrovnik, Croatia

Nenad Tomašev, Jan Rupnik, Dunja Mladenić, The Role of Hubs in Cross-lingual Supervised Document Retrieval, The Role of Hubs in Cross-lingual Supervised Document Retrieval, PAKDD 2013, 4/14/2013, 185-196, 978-3-642-37455-5, Pacific-Asia Conference on Knowledge Discovery and Data Mining, Gold Coast, Australia

Jan Rupnik, Andrej Muhič, Primož Škraba, Cross-Lingual Document Analysis, Cross-Lingual Document Analysis, MLCOGS 2013, 4/11/2013, Third EUCogIII Members Conference, Palma de Mallorca, Spain

Jaka Špeh, Andrej Muhič, Jan Rupnik, Parameter Estimation for the Latent Dirichlet Allocation, Parameter Estimation for the Latent Dirichlet Allocation, INFORMATION SOCIETY – IS 2013, 10/7/2013, A, 164, SiKDD 2013, Ljubljana, Slovenia

Zhixing Li, Siqiang Wen, Juanzi Li, Jie Tang, Peng Zhang(paper), On Modeling Non-linear Topical Dependencies, International Conference of Machine Learning, 2014, ICML2014, Beijing, China

Ageno, Alicia; Comas, Pere R; Padró, Lluís; Turmo, Jordi, The TALP-UPC approach to Tweet-Norm 2013, Proceedings of Workshop on Tweet Normalization at SEPLN (Tweet-Norm), 2013-09, 91–95, Workshop on Tweet Normalization at SEPLN (Tweet-Norm) , Madrid, Spain

Štefanec, Vanja, Srebačić, Matea, Šojat, Krešimir, A Method for Computational Representation of Croatian Morphology,  Language Processing and Intelligent Information Systems, Lecture Notes in Computer Science, Springer, 2013, 7912, 978-3-642-38633-6, 20th International Conference, IIS 2013, Warsaw, Poland

Šojat, Krešimir, Srebačić, Matea, Pavelić, Tin, Tadić, Marko, From Morphology to Lexical Hierarchies, Human Language Technologies as a Challenge for Computer Science and Linguistics, Proceedings of the 6th Language & Technology Conference, 2013, 978-83-932640-3-2, 6th Language & Technology Conference, Poznań, Poland

Agić, Željko, Merkler, Danijela, Berović, Daša, Parsing Croatian and Serbian by Using Croatian Dependency Treebanks, Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically-Rich Languages, Association for Computational Linguistics, 2013-10, 22-33, Fourth Workshop on Statistical Parsing of Morphologically-Rich Languages, Seattle, Washington, USA

Lei Hou, Juanzi Li, Xiaoli Li, Jiangfeng Qu, Xiaofei Guo, Ou Hui, Jie Tang , What Users Care About: A Framework for Social Content Alignment, Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI2013), IJCAI/AAAI 2013, 2013,  ISBN 978-1-57735-633-2,  Beijing , China

Mengling Xu, Zhichun Wang, Rongfang Bie, Juanzi Li, Chen Zheng, Wantian Ke, Mingquan Zhou, Discovering Missing Semantic Relations between Entities in Wikipedia, 12th International Semantic Web Conference, Springer, 2013, 1, 673-686,  ISBN 978-3-642-41334-6, Sydney , Australia

Tadić, Marko, Cross-lingual Knowledge Extraction (XLike), Proceedings of the XIV Machine Translation Summit, EAMT, 451, XIV Machine Translation Summit, Nice, France

Raphaël Bailly, Xavier Carreras, Franco M. Luque, Ariadna Quattoni, Unsupervised Spectral Learning of WCFG as Low-rank Matrix Completion, Proceedings of Empirical Methods on Natural Language Processing 2013 (EMNLP), 2013-10, Empirical Methods on Natural Language Processing, Seattle, USA

Zhigang Wang, Zhixing Li, Juanzi Li, Jie Tang and Jeff Z. Pan, Transfer Learning Based Cross-lingual Knowledge Extraction for Wikipedia, The 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013) , ACL2013, Sofia, Bulgaria

Šojat, Krešimir; Merkler, Danijela; Štefanec, Vanja; Srebačić, Matea; Tadić, Marko,  Combining morphological resources for Croatian, 9th Mediterranean Morphology Meeting, Dubrovnik, Croatia

Zhichun Wang, Juanzi Li, Jie Tang, Boosting Cross-Lingual Knowledge Linking via Concept Annotation, Proceedings of the 23rd International Joint Conference on Artificial Intelligence, IJCAI/AAAI 2013, 2013,  ISBN 978-1-57735-633-2, Beijing, China

Zhigang Wang, Zhixing Li, Juanzi Li, Jie Tang, Jeff Z. Pan, Transfer Learning Based Cross-lingual Knowledge Extraction for Wikipedia, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics ( Long Paper), The Association for Computer Linguistics, 1, 641-650, Sofia, Bulgaria

2012

Balle, Borja; Quattoni, Ariadna; Carreras, Xavier (2012) Local Loss Optimization in Operator Models: A New Insight into Spectral Learning. In: Proceedings of the 29th International Conference on Machine Learning (ICML-12). Edinburgh: Omnipress, pp. 1879-1886.

Li, Juanzi (2012) Cross-lingual Knowledge Linking across Wiki Knowledge Bases.In: In Proceedings of the 21st International Conference on World Wide Web (WWW ‘2012). New York: ACM, pp.459-468.

Lösch, Uta; Bloehdorn, Stephan; Rettinger Achim (2012) Graph Kernels for RDF Data. In: Simperl et. al. (eds.) Proceedings of the 9th Extended Semantic Web Conference (ESWC’12). Berlin: Heidelberg: Springer Verlag, pp. 134-148.

Luque, Franco M.; Quattoni, Ariadna; Balle, Borja; Carreras, Xavier (2012) Spectral Learning for Non-Deterministic Dependency Parsing. In: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics. Stroudsburg, PA: ACL, pp. 409-419.

Marimon, Montserrat; Padró, Lluís (2012) A Hybrid Approach to Treebank Construction. Procesamiento del Lenguaje Natural, 49: 139-148.

Padró, Lluís; Stanilovsky, Evgeny (2012) FreeLing 3.0: Towards Wider Multilinguality. In: Proceedings of the Language Resources and Evaluation Conference (LREC 2012). Istanbul: ELRA, pp. 2473: 2479.

Qian Zheng, Juanzi Li, Zhichun Wang, Lei Hou (2012) Co-mention and Context Based Entity Linking. In: Proceedings of the Joint Conference of the Sixth Chinese Semantic Web Symposium and the First Chinese Web Science Conference. Berlin: Heidelberg: Springer Verlag (in press).

Rupnik, Jan; Muhic, Andrej; Skraba, Primož (2012) Multilingual Document Retrieval Through Hub Languages. In: Proceedings of the ITI 2012. 34: 387- 392.

Rupnik, Jan; Muhic, Andrej; Skraba, Primož (2012) Multilingual Document Retrieval Through Hub Languages. In: Proceedings of the 15th International Multiconference on Information Society IS-2012. Ljubljana, 15: 201-204.

Štajner, Tadej (2012) Informal sentiment analysis in multiple domains for English and Spanish. In: Proceedings of 16th International Multiconference Information Society – Conference on Data Mining and Data Warehouses (SIKDD), Ljubljana 2012.

Štajner, Tadej (2012) Cross-lingual named entity extraction and disambiguation. In: Petelin, Dejan; Tavcar, Aleš; Kaluža, Boštjan (eds.) Proceedings of 4th Jožef Stefan International Postgraduate School Students Conference.

Tiedeman, Jörg; Ljubešic, Nikola (2012) Efficient Discrimination Between Closely Related Languages. In: Proceedings of the 24th International Conference on Computational Linguistics, COLING 2012. ACL, pp. 2619–2634.

Trampuš Mitja, Blaž Novak (2012) Internals Of An Aggregated Web News Feed. In: Proceedings of the 15th International Multiconference on Information Society IS-2012. Ljubljana.

Wen, Xubo, Xiaoli Ma, Huan Xia, Juanzi Li (2012) Inferring Public and Private Topics for Similar Events. In: Proceedings of the Joint Conference of the Sixth Chinese Semantic Web Symposium and the First Chinese Web Science Conference. Berlin: Heidelberg: Springer Verlag (in press).

Zhichun Wang, Juanzi Li, Yue Zhao, Rossi Setchi, Jie Tang (2013) A Unified Approach to Matching Semantic Data on the Web. Knowledge-Based Systems, 39: 173-184. (delivered in 2012)