Rahimanuddin's Blog

Skip to content

Home
About

← Building Corpus from Telugu Wikipedia Day 1

Building Telugu Corpus with Telugu Wikipedia data, second attempt →

March 24, 2017 · 1:10 pm

↓ Jump to Comments

Building Corpus from Telugu Wikipedia Day 2

Ok, so this day 2 came very long after day1.

I used this post from After the deadline to build corpus.

first convert the wikipedia dump xml into individual article files.

and then use various available tools to work with the corpus text files.

Share this:

Facebook
X

Like Loading...

Related

Leave a comment

Filed under Random

← Building Corpus from Telugu Wikipedia Day 1

Building Telugu Corpus with Telugu Wikipedia data, second attempt →

Leave a comment Cancel reply

Δ

Archives
Archives
about myself academics geek lab related miscellanious project Random

Rahimanuddin's Blog · The World Within

Comment
Reblog
Subscribe Subscribed
- Rahimanuddin's Blog
- Already have a WordPress.com account? Log in now.

%d