Jump to content
  • Advertisement
Sign in to follow this  

words

This topic is 3405 days old which is more than the 365 day threshold we allow for new replies. Please post a new topic.

If you intended to correct an error in the post then please contact us.

Recommended Posts

I have an algorithm that produces sequences of characters. Now I want to filter out those sequences that can be considered words of a natural language. My first idea was to google each word and interpret google's result page. As the algorithm needed several seconds to come up with a new sequence of characters, that seemed practical. But now I have improved the algorithm, and it spits out hundreds of thousands of sequences per second. Clearly google would detect my searches as DOS attacks, so I need another approach :) Any ideas?

Share this post


Link to post
Share on other sites
Advertisement
Quote:
Original post by Dancin_Fool
Couldn't you just use a dictionary file and search within that?

No, because dictionary words seem to be too unlikely to come out of my generating process. I am now trying markov chains of length 4 on a large text to determine if a subsequence of characters is legal.

Do these sound like words to you? :)

frietiantuti
racleeremovi
robsorteriel
smotestencer

Share this post


Link to post
Share on other sites
Quote:
Original post by swiftcoder
~240,000 words in newline delimited format.

Thanks, I invented some awesome new words based on that:

amicoelidiac
aristeosaltry
chindrumbalad
hochenwiseler
isomonabletta
machistocutie
pursenselberd
splathratheos

Share this post


Link to post
Share on other sites
Quote:
Original post by DevFred
amicoelidiac
aristeosaltry
chindrumbalad
hochenwiseler
isomonabletta
machistocutie
pursenselberd
splathratheos


Please consider using this truly wonderful invention to make a "fake word of the day" screensaver (possibly including markov-chain-generated definitions).

Share this post


Link to post
Share on other sites
Quote:
Original post by DevFred
Quote:
Original post by swiftcoder
~240,000 words in newline delimited format.

Thanks, I invented some awesome new words based on that:
amicoelidiac
aristeosaltry
chindrumbalad
hochenwiseler
isomonabletta
machistocutie
pursenselberd
splathratheos
Awesome! Any details on what exactly your generator is doing?

Share this post


Link to post
Share on other sites
Sign in to follow this  

  • Advertisement
×

Important Information

By using GameDev.net, you agree to our community Guidelines, Terms of Use, and Privacy Policy.

Participate in the game development conversation and more when you create an account on GameDev.net!

Sign me up!