Skip to content
View Jemoka's full-sized avatar
🐟
Niko niko nii
🐟
Niko niko nii
  • Stanford + #!/Shabang
  • Land of Lisp, EmacsVerse, or POMDPLand
  • 18:42 (UTC -08:00)

Organizations

@stanfordnlp @TalkBank @sisl @stanford-ssi @Shabang-Systems @MODAP @jklsnt

Block or report Jemoka

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Jemoka/README.md

Yo. Wassup? 👋

I am Houjun Liu (Jemoka, u/Jemoka). I am a student in the SF Bay Area, and I do ML and data stuff with Python, Node+front end things with JS, C++ (but probably Rust nowadays) when it actually matters, and Lisp (simple but refined, guaranteed to blow your mind).

Here's some things I currently do that may interest you:

  • 📢 TalkBank Batchalign: Have you ever wanted your audio transcribed, utterances segmented, and morphology analyzed? Well now you can. Done at the Psycolinguistics Lab at CMU. Read the paper or use the Python Package!
  • 🎤 #!/Shabang | Simon: semantic search—your data + postgres instance + 10 lines of code.
  • 🧠 Longitudinal NACC Data: What happens when you have a lot of very little features, and you are asked to accurately predict alzhimer's? Transformers go brrrrr. Done at UC Davis Engineering. Take a gander at our paper!
  • 📕 StanfordNLP Stanza: NLP for many human languages; I'm helping out with the model training, speed/performance optimizations, and coreference resolution!
  • 🗻 Stanford SSI Rover: We're sending a rover to Antartica for automatic surveying!
  • 📋 #!/Shabang | Condution: Awesome Checklist App for Humans and Aliens Alike w/ @zbuster05 @Exr0n @ban-ionic-ohms @TheEnquirer
  • 🌲 Plan of Thoughts: what if we did monte-carlo tree search to tree of thoughts?
  • ✍️ Blag: My website and Blag; the data from which is also driving a textual info extraction project. Check it out!

Here are some epic older projects:

  • 📆 scalandar: scheduling? automated. @thegail @JackHuhs @papayapaya @Jm0rr
  • 🗞 ConDef/Dictembed: Define a thing, just by context! with @zbuster05. Also a paper, you should probably ask me for a copy, though.
  • 🙊 chat-whisper: End-to-end clinical disfluency analysis, but with OpenAI Whisper. Done with CMU PsyLing and the Pittsburg Supercomputer—merged into the TalkBank Batchalign project
  • 🌐 these patches (1, 2) to linux-mbp-wifi: making wifi work for BCM4377 on Linux!!
  • 🕵️‍ MODAP stack: an open-source effort to make fire detection and segmentation easier when drones arn't allowed
  • 🛀 Replier: Talk with Your Transformer Buddy for Counseling. Check it out on ArXiv!
  • 🤖 gregarious: Easy-Peasy Findy of Robot-ies on Tweeties. Also a research paper. You should use this version, which is better maintained.
  • 🗣 PolitiSort: Sort your political stance, with a little LSTM w/ @zbuster05
  • 🧑‍💻 Borg: Rapidly Configure a Lot of Things

And hey, here's a bit about me:

  • 🖋 I think fountain pens are excellent
  • 🛣 Musik nonstop. Techno-pop., baybee
  • 📹 I make videos, whenever I feel like it
  • 🎙 Host(ed?) a podcast with @ban-ionic-ohms. It's kinda fun.
  • ✅ I am seriously interested in GTD and also in just being productive
  • 😂 I tried to do standup. That did not go well.

Pinned Loading

  1. Shabang-Systems/Condution Shabang-Systems/Condution Public

    Tasks? Done. That was quick.

    JavaScript 486 21

  2. TalkBank/batchalign2 TalkBank/batchalign2 Public

    Tools for language sample analysis.

    Python 15 4

  3. Shabang-Systems/simon Shabang-Systems/simon Public

    AI search: your data + 10 lines of code.

    Python 73 4

  4. .emacs.d .emacs.d Public

    OMG OMG OMGOMG emaaacs?

    Emacs Lisp 3 1

  5. blag blag Public

    My blag and knowledgebase.

    HTML

  6. stanfordnlp/stanza stanfordnlp/stanza Public

    Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

    Python 7.3k 896