LASI - Old Dominion University

LASI - Old Dominion University

LASI Linguistic Analysis for Subject Identification Milestone Presentation Presented by: CS410 Red Group 3/1/20 2 3/1/20 Outline Team Red Staff

Chart Introduction Problem Statement LASI in our Case Study Functional Components Algorithms Milestones Document Parsing

Weighter GUI Flow GUI Screenshots Risk Matrix Competition Matrix Conclusion 3 3/1/20 Team Red Staff Chart Brittany Scott Minter Project Co Leader Software Specialist

Johnson Project Co Leader Documentation Specialist Dustin Richard Patrick Owens Algorithm Specialist Expert Liaison Documentation Specialist

Communication Specialist Aluan Haddad Erik Rogers Algorithm Specialist Software Specialist Marketing Specialist GUI Developer 4 What is LASI? 3/1/20

5 3/1/20 LASI: Linguistic Analysis for Subject Identification LASI LASI THEMES 6 3/1/20

LASI Identifies Themes (5 Ws & 1 H) Who What When Where Why How 7 3/1/20

Why are themes important? Comprehension Summarization Assists in communication between people 8 Societal Problem It is difficult for people to identify a common theme over a large set of documents in a timely, consistent, and objective manner. 3/1/20 9

Our Proposed Solution LASI is a linguistic analysis decision support tool used to help determine a common theme across multiple documents. It is our goal with LASI to: accurately find themes be system efficient provide consistent results 3/1/20 10 3/1/20 What do we mean by linguistic analysis? The contextual study of written works and how the words combine to form an overall

meaning. 11 3/1/20 Dr. Patrick Hester & Dr. Tom Meyers: The AID Process Assessment Improvement Design Dr. Hester & Dr. Meyers are systems analysts and researchers for NCSOSE Conduct extensive research Quickly become familiar

with client systems Dr. Hester Dr. Meyers Formulate concise, objective assessments 12 3/1/20 Before LASI Continue on to the rest of the A.I.D Process Customer Contact

yes Is the Custome r satisfied ? Situational Awareness Meeting Will NCSOSE be needed? no

yes no Document Gathering Process Client Goes Elsewhere Problem Statement Presentation Document Analysis 13

3/1/20 After LASI Continue on to the rest of the A.I.D Process Customer Contact yes Is the Custome r satisfied ? Situational Awareness Meeting

Will NCSOSE be needed? no yes no Document Gathering Process Client Goes Elsewhere Problem

Statement Presentation Document Analysis 14 3/1/20 Major Functional Components Hardware Software Algorithm: High End Notebook PC - Computation

Quad-Core CPU - Primary Memory 8.0 GB DDR3 RAM - Document Storage Solid State Storage ~$1500 USD Extrapolates the most likely congruence of themes and ideas across all documents in the input domain User Interface: - Multi-Level Views - Weighted Phrase List - Detailed Breakdown - Step by Step Justification

15 3/1/20 Linguistic Analysis Algorithm Primary Analysis: Word Count and Syntactic Assessment Secondary Analysis: Associative Identification Tertiary Analysis: Semantic

Relationship Assessment Traverse Document in Word-Wise Manner Bind Pronouns to Nouns, Updating Frequency Identify Potential Synonyms Identify Corresponding Parts of Speech Bind Adjectives to

Nouns Assess Potential Subject-ObjectVerb Relationships Identify Potential Noun Phrases Output List of Weighted Themes Determine Frequency by Grammatical Role 16 LASI Milestones

3/1/20 17 Document Parsing 3/1/20 18 Weighter 3/1/20 19 GUI Flow 3/1/20

20 Splash Screen 3/1/20 21 New Project Screen 3/1/20 22 Results Page 3/1/20

23 3/1/20 Risk Matrix Customer Risks C1 -- Product Interest C2 -- Maintenance C3 -- Trust Technical Risks T1 -- System Limitations T2 -- Scanned Text Recognition T3 -- Jargon Recognition T4 Illegal Character Handling 24

3/1/20 Customer Risks C1. Product Interest Probability 2 Impact 4 Mitigation: LASI offers unique functionality and user-friendliness. C2. Maintenance Probability 3 Impact 2 Mitigation: LASI will be a free, open source application allowing the community to maintain and extend it over time. C3. Trust Probability 3 Impact 3 Mitigation: LASI will provide a step by step breakdown of output analysis and algorithm reasoning

25 3/1/20 Technical Risks T1. System Limitations Probability 4 Impact 2 Mitigation: LASI will be designed from the ground up in native C++ for memory and CPU efficient code. T2. Scanned Text Recognition Probability 4 Impact 3

Mitigation: LASI will implement an optical character recognition algorithm to handle scanned text 26 3/1/20 Technical Risks T3. Jargon Recognition Probability 3 Impact 2 Mitigation: LASI will have domain specific dictionaries and feature intuitive contextual inference. T4. Illegal Character Handling Probability 4 Impact 2 Mitigation: LASI will providers contextual inference, synonym recognition and statistical methods

27 The Competition 3/1/20 28 3/1/20 Conclusion There is a need for LASI LASI is an algorithm heavy program Success is beneficial to anyone needing to analyze large sets of documents in a timely, consistent and objective manner 29

3/1/20 References Patrick Hester" Old Dominion University. N.p., n.d. Web. 24 Sept. 2012 . "Tom Meyers." NCSOSE. N.p., n.d. Web. 22 Nov. 2012. . Stanislaw Osinski, Dawid Weiss. 13 August, 2012 . Carrot 2. 9/25/2012 . WordStat Provalis Research. Web. 24 Sept. 2012. . ReadMe: Software for Automated Content Analysis Web. 24 Sept. 2012. "AlchemyAPI Overview." AlchemyAPI. N.p., n.d. Web. 19 Oct. 2012. . "AutoMap:." Project. N.p., n.d. Web. 19 Oct. 2012. .

"CL Research Home Page." CL Research Home Page. N.p., n.d. Web. 19 Oct. 2012. .

Recently Viewed Presentations

  • Module 10: Facility Design Case Study What Happened

    Module 10: Facility Design Case Study What Happened

    The farmer declines again. Finally, the man offers three times the going rate for the milk. The farmer, fed up with him asking, agrees to sell him the wrong milk. The man feels 5 gallon bucket's with raw milk, rolls...
  • Section 7.8: Groups of vertices - Naval Postgraduate School

    Section 7.8: Groups of vertices - Naval Postgraduate School

    Existence of a cut vertex. Cut sets. Degree distribution. Tight node/edge neighborhoods. Clique, plex, core, community, k-dense (for edges) ... Alternatively: A k-component is a connected maximal subgraph such that there are k-vertex-independent paths between any two vertices ...
  • The Outsiders by S. E. Hinton

    The Outsiders by S. E. Hinton

    The Outsiders. Life is about choices? One choice leads. to another. What might have happened if Pony and. Johnny had gone to . Darry instead of Dally? How might the. outcome have . changed? ARE THE CHOICES WE MAKE THE...
  • Reasons for Investing

    Reasons for Investing

    Unit 5 Investing
  • Get up Rick! Vocabulary and Spelling Book 1 Story 4

    Get up Rick! Vocabulary and Spelling Book 1 Story 4

    pick. To select from a group. I will pick an apple to eat. pack. A collection of items to be gathered and placed inside a bag or sack. I will pack my book bag. tack. A short, light nail with...
  • Textbook:  .. English Lexicology in Theory and Practice

    Textbook: .. English Lexicology in Theory and Practice

    Lexicography is the science and art of dictionary-compiling, is traditionally included in a course of Lexicology. Modern English Lexicology studies: Semasiology. Word-Structure. Word-Formation. Etymology of the English Word-Stock. Word-groups. Phraseology. Variants of the English Language.
  • W5 of Computer Engineering (Why, What, When, Where, How)

    W5 of Computer Engineering (Why, What, When, Where, How)

    Binder/ Bitumen/Asphalt. Bituminous Paving mixes ( Bituminous mix) Cement and Cement concrete - their engineering and physical properties, basic tests….learnt in semester 4.. Will not discuss. Highway Engineering-Bitumen-Lecture 4 . Darshan Institute of Engineering & Technology
  • Transport properties of strongly coupled gauge theories from ...

    Transport properties of strongly coupled gauge theories from ...

    Andrei Starinets Oxford University From Gravity to Thermal Gauge Theories: the AdS/CFT correspondence Fifth Aegean Summer School Island of Milos Greece September 21-26, 2009 What is viscosity? Viscosity of gases and liquids First-order transport (kinetic) coefficients Second-order transport ...