Coder Social home page Coder Social logo

nelsonjchen / github-wiki-see-rs Goto Github PK

View Code? Open in Web Editor NEW
92.0 3.0 18.0 1.14 MB

๐Ÿ”Ž Did you know most GitHub Wikis can't index on search engines? Search Engine Enablement for GitHub Wikis service. 400,000+ GitHub Wikis, now indexable by your favorite search engine.

Home Page: https://github-wiki-see.page

License: MIT License

Dockerfile 0.19% Rust 33.66% HTML 63.83% TypeScript 2.14% JavaScript 0.14% Shell 0.04%
github wiki seo-optimization github-wikis

github-wiki-see-rs's Introduction

GitHub Wiki SEE

GitHub GitHub

As seen at https://github-wiki-see.page by search engines and archivers!

GitHub Wiki Search Engine Enablement is a service to allow GitHub Wikis to be indexed by search engines.

This is a terribly and hastily built service. However, it is usable and MVP!

This was made in response to community/community#4992, an issue that has been present for 10+ years.

Design

This is designed as a Rust web proxy application. It is not very Rusty and has lots of hackiness. Cleanup, major overhauls, and straightening are much appreciated!

It is designed to run as a simple Docker application on a service such as fly.io and/or Google Cloud Run. Uptime and latency are important so that the service appears on search engines and gets a high ranking.

301/302/307/308s are intentionally not used as to not give search engines the impression that the page is a redirect and to ignore the content. Humans should see the "content" as a redirect; the robots should not.

All links rendered in the tool going outside of GitHub are tagged with rel="nofollow ugc" to prevent ranking manipulation which is probably one the reason wiki content was excluded from indexing.

A Cloudflare Worker is placed in front to additionally protect against the service accidentally mirroring indexable content on GitHub. The worker also enriches a "last modified" header date on the proxied content if possible from the original content if the original content isn't indexable to better hint to search engines the freshness of content and better utilize their crawler budget.

Decommissioning

Please see:

https://github-wiki-see.page/#decommissioning

But basically if GitHub lets it be indexed, this service will 308 redirect it.

github-wiki-see-rs's People

Contributors

bluscream avatar dependabot[bot] avatar nelsonjchen avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

github-wiki-see-rs's Issues

Anchor links not working

Fixing of #345 resulted in anchor links showing up in index. Maybe good, maybe bad. Either way, the anchor links don't work for some reason.

Redirect to wiki page?

It would be cool if clicking on the see.page link automatically redirected you to GitHub

Gradually Decommission known indexable GitHub Wikis

  • Use BigQuery to locate all wikis
  • Scan/Check all Wikis's Home to see which are are not blocked by the header tag
  • Burn in list of wikis that have indexing enabled. The list is small enough to fit in the binary.
  • Redirect users/search engines automatically to GitHub where indexing is enabled.

Thank you

I just want to say "Thank you!" for this project! :-)

My wiki pages now appear in search engines. First visitors come from https://github-wiki-see.page/ , this was how I got attention of your project.

๐Ÿšจ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ ๐Ÿšจ issacs/github#1683 viewers! That issue is closed. See this GitHub Discussion on Wiki Indexing on GitHub's feedback discussion forum.

Viewers of isaacs/github#1683 (comment)

See community/community#4992

This issue is just a stub to redirect.

You are first result in Google

hey, thanks for push forward on indexing wikis! I wonder how you got ranked so quickly, without having any backlinks to your site?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.