AJOU Central Library Repository: 감성분석 자연어처리를 위한 사전 자동화 구축

BROWSE

Special Graduate Schools Graduate School of Information and Communication Technology Intelligent Software 3. Theses(Master)

감성분석 자연어처리를 위한 사전 자동화 구축

DC Field	Value	Language
dc.contributor.advisor	손경아	-
dc.contributor.author	유정현	-
dc.date.accessioned	2022-11-29T03:01:01Z	-
dc.date.available	2022-11-29T03:01:01Z	-
dc.date.issued	2022-02	-
dc.identifier.other	31769	-
dc.identifier.uri	https://dspace.ajou.ac.kr/handle/2018.oak/20645	-
dc.description	학위논문(석사)--아주대학교 정보통신대학원 :지능형소프트웨어,2022. 2	-
dc.description.abstract	최근 인공지능 기술이 발달함에 따라, 자연어 및 이미지 처리 기술이 활성화되고 있다. 특히, 자연어처리 분야에서는, TTS나 음성인식 텍스트, 감성분석의 분야가 두각을 드러내고 있으며, 이 중 감성분석 분야는 논문, 뉴스기사 등 전문가가 작성한 정형화된 형식의 방대한 데이터를 제공받을 수 있다는 점에서 다양한 분야에 활용되고 있다. 그러나, 감성분석 분야는 전통금융, 사회 등 기존에 구축된 방대한 학습사전(Corpus)을 활용하여 감성분석을 진행할 수 있지만, 가상자산과 같은 새로운 분야에 대해서 좋은 성능의 감성분석을 진행하려면, 기존 분야에서 사용되었던 만큼의 방대한 학습사전이 필요하다. 따라서, 본 논문에서는 Apache Airflow라는 Scheduler 기반 Data Orchestration 오픈소스를 활용하여, 하루마다 새로운 분야에 대한 학습사전 분류모델을 적용하며, AWS EC2를 활용해 자동으로 갱신하여, 자연어처리에 사용될 학습 사전을 강화하고자 한다.	-
dc.description.tableofcontents	제 1 장 서론 1 제 2 장 관련 연구 5 제 3 장 배경 지식 7 제 1 절 텍스트 전처리 7 제 2 절 Naive Bayes Bayes's Theorem 8 제 3 절 피어슨 상관계수 9 제 4 장 사전 자동화 시스템 구축 11 제 5 장 실험 결과 13 제 1 절 데이터 수집 및 전처리 13 제 2 절 결과 18 제 6 장 결 론 24 제 7 장 참고문헌 25 제 8 장 Abstract 27	-
dc.language.iso	kor	-
dc.publisher	The Graduate School, Ajou University	-
dc.rights	아주대학교 논문은 저작권에 의해 보호받습니다.	-
dc.title	감성분석 자연어처리를 위한 사전 자동화 구축	-
dc.type	Thesis	-
dc.contributor.affiliation	아주대학교 정보통신대학원	-
dc.contributor.alternativeName	Yoo Jeonghyun	-
dc.contributor.department	정보통신대학원 지능형소프트웨어	-
dc.date.awarded	2022. 2	-
dc.description.degree	Master	-
dc.identifier.localId	T000000031769	-
dc.identifier.uci	I804:41038-000000031769	-
dc.identifier.url	https://dcoll.ajou.ac.kr/dcollection/common/orgView/000000031769	-
dc.subject.keyword	빅데이터	-
dc.subject.keyword	자연어처리	-
dc.description.alternativeAbstract	With the recent development of artificial intelligence technology, natural language and image processing technologies are being activated. In particular, in the field of natural language processing, TTS, voice recognition text, and sentimental analysis stand out, and among them, the field of sentimental analysis is used in various fields in that it can receive a large amount of formal data written by experts such as papers and news articles. However, in the field of sentimental analysis, sentimental analysis can be conducted using the existing vast learning Corpus such as traditional finance and society, but in order to proceed with sentimental analysis of good performance in new fields such as virtual assets, a vast learning corpus is needed. Therefore, this paper intends to strengthen the learning corpus to be used for natural language processing by applying a learning pre-classification model for new fields every day using a scheduler-based Data Orchestra open source called Apache Airflow and automatically updating it using AWS EC2.	-

Appears in Collections:: Special Graduate Schools > Graduate School of Information and Communication Technology > Intelligent Software > 3. Theses(Master)

Files in This Item:: There are no files associated with this item.

Show simple item record

qrcode

트윗하기

License

STATISTICS: Total Visit :5,042,531; Total Download :2,100; Today View :736

AJOU Central Library Repository는 국립중앙도서관 OAK 보급사업으로 구축되었습니다.

BROWSE

Browse