Coder Social home page Coder Social logo

aourednik / text2landscape Goto Github PK

View Code? Open in Web Editor NEW
12.0 2.0 3.0 31.77 MB

Visualize a corpus of texts as a landscape with the aid of text mining, graph visualization and self-organizing maps

Home Page: https://ourednik.info/maps/2022/02/04/text2landscape-visualize-a-text-in-multiple-spaces-with-r-network-visualization-word-embeddings-principal-component-analysis-and-self-organizing-maps/

text-mining word2vec quanteda ggraph self-organizing-map graph force-directed-graphs

text2landscape's Introduction

Text2Landscape

An R markdown notebook

The core of this repository is an Rmd notebook that allows you to transform a corpus of texts to a landscape by the tools of computer linguistics, statistics and data visualization packages. Download it and open it in RStudio or VSCode to experiment with the code, or to apply it to your own corpus. Forks and pull requests are welcome!

▸ See the DOCUMENTATION.

Introduction

You will find no realistic landscapes prior to the Renaissance. The saints of medieval murals float in a conceptual space informed by hierarchies and symbolic relations; so do those of the Prajñāpāramitā Sūtras. The word "landscape" appears with the Dutch painters of the 15th century. A landscape is a part of the world perceived by a human being at a given moment; an arrangement of features and shapes in a limited space. The Dutch were focused on natural landscapes. Late 20th-century urbanism deals with urban landscapes.

A text can be transformed into a landscape by the tools of computer linguistics, statistics and data visualization packages. Here we work with the example of three texts by the French geographer, writer and anarchist Élisée Reclus: Histoire d'une Montagne, Histoire d'un ruisseau and L'Anarchie.

Visualize the frequencies

The Text as a Network of Word Co-occurrence

Calculate the feature co-occurrence matrix

Visualise the graph

3D variant

Self-organizing map from network

From the feature-coocurrence matrix to a positional matrix with multidemensional scaling

From MDS to SOM

The biofabric layout

Word Embeddings

Principal Component Analysis

Self-organizing Map

text2landscape's People

Contributors

aourednik avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.