2019年に医療言語処理のビッグウェーブが来たかもしれない
医学博士過程で自然言語処理の読影レポートへの応用をテーマにしようとしています。
サーベイの副産物です。
医療言語処理の国際会議論文を探す
この機械学習の一大ブームのなかでも,特に医療への応用が遅れがちな自然言語処理。
現状はどうなっているでしょうか。
ACL, NAACL, EMNLP, CoNLL, IJCNLP, COLING, EACL, LRECの2015年以降の国際会議論文のうち,医療と関連していそうなものを独断と偏見でひたすら列挙してみました。
具体的には,ACL Anthologyから以下の検索語をタイトルに含む論文を全探索し,医療と明らかに無関係なものを除外しています。
検索語: 'medic', 'biomedic', 'clinic', 'health', 'life', 'care', 'pharma', 'hospital', 'drug', 'surg', 'ICU', 'emergen', 'patient', 'disease', 'symptom', 'illness', 'radiolog', 'x-ray', 'CT', 'MRI', 'radiograph', 'tomograph', 'magnetic'
2019年に急激に流行りだしている
はじめに本数をまとめておきます。
2019年は現時点でACL, NAACL以外の会議論文が公開されていませんが,すでにACLとNAACLの合計だけで去年までの本数をはるかに凌いでいます。
今年に入って急激に医療言語処理が流行りだしてきているのが見て取れます。
特にNAACLで医療言語処理や精神医学への応用のワークショップが開催された影響はかなり大きいとみて良いでしょう。
なお言わずもがなですが,毎年開催ではない国際会議もある点にはご留意を。
(2019年の勢いがすごい!!)
以下,ひたすら論文を列挙します。
2015
ACL-IJCNLP 2015
Who caught a cold ? - Identifying the subject of a symptom
Disease Event Detection based on Deep Modality Analysis
Sieve-Based Entity Linking for the Biomedical Domain
注目すべきは上から2つの論文はいずれも首都大・小町研究室から発表されていることです。医療言語処理の黎明期の段階から3本中2本も医療言語処理のpaperをACLに通されていることには敬服するばかりです。
NAACL 2015
Matching Citation Text and Cited Spans in Biomedical Literature: a Search-Oriented Approach
Extracting Information about Medication Use from Veterinary Discussions
EMNLP 2015
Key Concept Identification for Medical Information Retrieval
Adapting Phrase-based Machine Translation to Normalise Medical Terms in Social Media Messages
#SupportTheCause: Identifying Motivations to Participate in Online Health Campaigns
CoNLL 2015
(No medical NLP articles)
2016
ACL 2016
Normalising Medical Concepts in Social Media Texts by Learning Semantic Representation
Recurrent neural network models for disease name recognition using domain invariant features
Vector-space topic models for detecting Alzheimer’s disease
Identifying Potential Adverse Drug Events in Tweets Using Bootstrapped Lexicons
DeepLife: An Entity-aware Search, Analytics and Exploration Platform for Health and Life Sciences
EMNLP 2016
Structured prediction models for RNN based sequence labeling in clinical text
CoNLL 2016
(No medical NLP articles)
COLING 2016
Appraising UMLS Coverage for Summarizing Medical Evidence
Adverse Drug Reaction Classification With Deep Neural Networks
A Hybrid Approach to Generation of Missing Abstracts in Biomedical Literature
LREC 2016
Sieve-based Coreference Resolution in the Biomedical Domain
Staggered NLP-assisted refinement for Clinical Annotations of Chronic Disease Events
A Tagged Corpus for Automatic Labeling of Disabilities in Medical Scientific Papers
BosphorusSign: A Turkish Sign Language Recognition Corpus in Health and Finance Domains
Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages
Automatic Biomedical Term Polysemy Detection
A Corpus of Clinical Practice Guidelines Annotated with the Importance of Recommendations
Identification of Drug-Related Medical Conditions in Social Media
Transfer-Based Learning-to-Rank Assessment of Medical Term Technicality
A Large Rated Lexicon with French Medical Words
The Scielo Corpus: a Parallel Corpus of Scientific Publications for Biomedicine
Towards Using Social Media to Identify Individuals at Risk for Preventable Chronic Illness
Managing Linguistic and Terminological Variation in a Medical Dialogue System
A Corpus of Word-Aligned Asked and Anticipated Questions in a Virtual Patient Dialogue System
Annotating Named Entities in Consumer Health Questions
Age and Gender Prediction on Health Forum Data
Monitoring Disease Outbreak Events on the Web Using Text-mining Approach and Domain Expert Knowledge
On Developing Resources for Patient-level Information Retrieval
Annotating and Detecting Medical Events in Clinical Notes
Text Segmentation of Digitized Clinical Texts
The PsyMine Corpus - A Corpus annotated with Psychiatric Disorders and their Etiological Factors
QUEMDISSE? Reported speech in Portuguese
Semantic Relation Extraction with Semantic Patterns Experiment on Radiology Reports
Medical Concept Embeddings via Labeled Background Corpora
2017
ACL 2017
Joint CTC/attention decoding for end-to-end speech recognition
Lifelong Learning CRF for Supervised Aspect Extraction
Computational Characterization of Mental States: A Natural Language Processing Approach
Negotiation of Antibiotic Treatment in Medical Consultations: A Corpus Based Study
Automating Biomedical Evidence Synthesis: RobotReviewer
Life-iNet: A Structured Network-Based Knowledge Exploration and Analytics System for Life Sciences
Olelo: A Question Answering Application for Biomedicine
Semedico: A Comprehensive Semantic Search Engine for the Life Sciences
EMNLP 2017
CoNLL 2017
Neural Domain Adaptation for Biomedical Question Answering
Idea density for predicting Alzheimer’s disease from transcribed speech
IJCNLP 2017
Extraction of Gene-Environment Interaction from the Biomedical Literature
Learning to Diagnose: Assimilating Clinical Narratives using Deep Reinforcement Learning
Identifying Empathetic Messages in Online Health Communities
PubMed 200k RCT: a Dataset for Sequential Sentence Classification in Medical Abstracts
Language-Independent Prediction of Psycholinguistic Properties of Words
WiseReporter: A Korean Report Generation System
CYUT at IJCNLP-2017 Task 3: System Report for Review Opinion Diversification
EACL 2017
Multitask Learning for Mental Health Conditions with Limited Social Media Data
Named Entity Recognition in the Medical Domain with Constrained CRF Models
Psycholinguistic Models of Sentence Processing Improve Sentence Readability Ranking
Structured Learning for Temporal Relation Extraction from Clinical Records
A Computational Analysis of the Language of Drug Addiction
Neural Networks for Joint Sentence Classification in Medical Paper Abstracts
Temporal information extraction from clinical text
2018
ACL 2018
Modeling Naive Psychology of Characters in Simple Commonsense Stories
On the Automatic Generation of Medical Imaging Reports
Enhancing Drug-Drug Interaction Extraction from Texts by Molecular Structure Information
Pushing the Limits of Radiology with Joint Modeling of Visual and Textual Information
Biomedical Document Retrieval for Clinical Decision Support System
Identifying Risk Factors For Heart Disease in Electronic Medical Records: A Deep Learning Approach
Ontology alignment in the biomedical domain using entity definitions and context
PICO Element Detection in Medical Text via Long Short-Term Memory Neural Networks
Coding Structures and Actions with the COSTA Scheme in Medical Conversations
Biomedical Event Extraction Using Convolutional Neural Networks and Dependency Parsing
Domain Adaptation for Disease Phrase Matching with Adversarial Networks
Predicting Discharge Disposition Using Patient Complaint Notes in Electronic Medical Records
A Framework for Developing and Evaluating Word Embeddings of Drug-named Entity
Prediction Models for Risk of Type-2 Diabetes Using Health Claims
Investigating Domain-Specific Information for Neural Coreference Resolution on Biomedical Texts
Toward Cross-Domain Engagement Analysis in Medical Notes
Report of NEWS 2018 Named Entity Transliteration Shared Task
CYUT-III Team Chinese Grammatical Error Diagnosis System Report in NLPTEA-2018 CGED Shared Task
NAACL 2018
Label-Aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition
Explainable Prediction of Medical Codes from Clinical Text
CliCR: a Dataset of Clinical Case Reports for Machine Reading Comprehension
Similarity Measures for the Detection of Clinical Conditions with Verbal Fluency Tasks
Multi-Task Learning Framework for Mining Crowd Intelligence towards Clinical Treatment
Syntactic Patterns Improve Information Extraction for Medical Search
From dictations to clinical reports using machine translation
Towards Generating Personalized Hospitalization Summaries
An automated medical scribe for documenting clinical encounters
Generating Continuous Representations of Medical Texts
RiskFinder: A Sentence-level Risk Detector for Financial Reports
A Report on the Complex Word Identification Shared Task 2018
Modeling Second-Language Learning from a Psychological Perspective
What type of happiness are you looking for? - A closer look at detecting mental health from language
CLPsych 2018 Shared Task: Predicting Current and Future Psychological Health from Childhood Essays
A Psychologically Informed Approach to CLPsych Shared Task 2018
RSDD-Time: Temporal Annotation of Self-Reported Mental Health Diagnoses
A Report on the 2018 VUA Metaphor Detection Shared Task
EMNLP 2018
Fine-Grained Emotion Detection in Health-Related Online Posts
Lessons from Natural Language Inference in the Clinical Domain
Annotation of a Large Clinical Entity Corpus
emrQA: A Large Corpus for Question Answering on Electronic Medical Records
Structured Multi-Label Biomedical Text Tagging via Attentive Neural Tree Decoding
Hierarchical Neural Networks for Sequential Sentence Classification in Medical Scientific Abstracts
Predicting Factuality of Reporting and Bias of News Media Sources
CARER: Contextualized Affect Representations for Emotion Recognition
Learning Disentangled Representations of Texts with Application to Biomedical Abstracts
Extraction Meets Abstraction: Ideal Answer Generation for Biomedical Questions
UNCC QA: Biomedical Question Answering system
Does it care what you asked? Understanding Importance of Verbs in Deep Learning QA System
Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis
Revisiting neural relation classification in clinical notes with external information
Supervised Machine Learning for Extractive Query Based Summarisation of Biomedical Data
Investigating the Challenges of Temporal Relation Extraction from Clinical Text
De-identifying Free Text of Japanese Dummy Electronic Health Records
Automatically Detecting the Position and Type of Psychiatric Evaluation Report Sections
CAS: French Corpus with Clinical Cases
Analysis of Risk Factor Domains in Psychosis Patient Health Records
Patient Risk Assessment and Warning Symptom Detection Using Deep Attention-Based Neural Networks
Syntax-based Transfer Learning for the Task of Biomedical Relation Extraction
In-domain Context-aware Token Embeddings Improve Biomedical Named Entity Recognition
Listwise temporal ordering of events in clinical notes
Time Expressions in Mental Health Records for Symptom Onset Extraction
Evaluation of a Sequence Tagging Tool for Biomedical Texts
Learning to Summarize Radiology Findings
Overview of the Third Social Media Mining for Health (SMM4H) Shared Tasks at EMNLP 2018
Thumbs Up and Down: Sentiment Analysis of Medical Online Forums
Identification of Emergency Blood Donation Request on Twitter
Dealing with Medication Non-Adherence Expressions in Twitter
Drug-Use Identification from Tweets with Word and Character N-Grams
Automatic Identification of Drugs and Adverse Drug Reaction Related Tweets
Deep Learning for Social Media Health Text Classification
Using PPM for Health Related Text Detection
Leveraging Web Based Evidence Gathering for Drug Information Identification from Tweets
Classification of Tweets about Reported Events using Neural Networks
A Call for Clarity in Reporting BLEU Scores
Findings of the WMT 2018 Biomedical Translation Shared Task: Evaluation on Medline test sets
Translation of Biomedical Documents with Focus on Spanish-English
UFRGS Participation on the WMT Biomedical Translation Shared Task
CoNLL 2018
(No medical NLP articles)
COLING 2018
Improving Feature Extraction for Pathology Reports with Precise Negation Scope Detection
Adaptive Multi-Task Transfer Learning for Chinese Word Segmentation in Medical Text
Document Representation Learning for Patient History Visualization
A Rich Annotation Scheme for Mental Events
An Evaluation of Information Extraction Tools for Identifying Health Claims in News Headlines
The Interplay of Form and Meaning in Complex Medical Terms: Evidence from a Clinical Corpus
A Treebank for the Healthcare Domain
LREC 2018
A FrameNet for Cancer Information in Clinical Narratives: Schema and Annotation
Parallel Corpora for the Biomedical Domain
Medical Entity Corpus with PICO elements and Sentiment Analysis
Annotating Spin in Biomedical Scientific Publications : the case of Random Controlled Trials (RCTs)
Visualization of the occurrence trend of infectious diseases using Twitter
Expert Evaluation of a Spoken Dialogue System in a Clinical Operating Room
A Corpus of Drug Usage Guidelines Annotated with Type of Advice
BioRo: The Biomedical Corpus for the Romanian Language
Biomedical term normalization of EHRs with UMLS
Mining Biomedical Publications With The LAPPS Grid
J-MeDic: A Japanese Disease Name Dictionary based on Real Clinical Usage
BioRead: A New Dataset for Biomedical Reading Comprehension
Medical Sentiment Analysis using Social Media: Towards building a Patient Assisted System
From ‘Solved Problems’ to New Challenges: A Report on LDC Activities
Annotating Reflections for Health Behavior Change Therapy
Construction of the Corpus of Everyday Japanese Conversation: An Interim Report
Profiling Medical Journal Articles Using a Gene Ontology Semantic Tagger
2019
NAACL 2019
Neural language models as psycholinguistic subjects: Representations of syntactic state
Sentence Embedding Alignment for Lifelong Relation Extraction
Analyzing the Perceived Severity of Cybersecurity Threats Reported on Social Media
Biomedical Event Extraction based on Knowledge-driven Tree-LSTM
Inferring Which Medical Treatments Work from Reports of Clinical Trials
Augmenting word2vec with latent Dirichlet allocation within a clinical application
Applications of Natural Language Processing in Clinical Research and Practice
A Report on the Third VarDial Evaluation Campaign
A Survey on Biomedical Image Captioning
Proceedings of the 2nd Clinical Natural Language Processing Workshop
Effective Feature Representation for Clinical Text Concept Extraction
An Analysis of Attention over Clinical Notes for Predictive Tasks
Extracting Adverse Drug Event Information with Minimal Engineering
Towards Automatic Generation of Shareable Synthetic Clinical Notes Using Neural Language Models
A Novel System for Extractive Clinical Note Summarization using EHR Data
Study of lexical aspect in the French medical language. Development of a lexical resource
Publicly Available Clinical BERT Embeddings
A General-Purpose Annotation Model for Knowledge Discovery: Case Study in Spanish Clinical Text
Predicting ICU transfers using text messages between nurses and doctors
Medical Entity Linking using Triplet Network
Annotating and Characterizing Clinical Sentences with Explicit Why-QA Cues
Extracting Factual Min/Max Age Information from Clinical Trial Studies
Medical Word Embeddings for Spanish: Development and Evaluation
Probing Biomedical Embeddings from Language Models
Distantly Supervised Biomedical Knowledge Acquisition via Knowledge Graph Based Attention
Browsing Health: Information Extraction to Support New Interfaces for Accessing Medical Evidence
From News to Medical: Cross-domain Discourse Segmentation
Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology
The importance of sharing patient-generated clinical speech and language data
Mental Health Surveillance over Social Media with Digital Cohorts
傾向
眺めていると,医療言語処理の中でもよく扱いやすいテーマがあるのか,
という傾向が見てとれました。