highlights-convert

I maintain this project to convert my Kindle highlights to a webpage, and a few intermediate dataformats(that are easier to parse in code).

The raw format(Kindle HTML export format) is converted to a JSON file and a markdown file, which is further exported to a Jekyll website which can be found here http://tarangshah.com/books.

I also had a few old books I read on Google Play Books that exported highlights in MS Word format. There is a parser for that as well.

Content of the project

Conversion scripts for Google Play Books(Exported to Drive) and Kindle Email export(HTML)
- index.js has the main code
- The main classes for parsing are in KindleConvert.js and GooglePlayConverter.js
- renderer.js and postProcessJson.js are used for the final document generation tasks
Raw and intermediate highlight files
{these were essentially text files and quite small in size, github seemed like the perfect place to store these files. You can find them in the "Raw" and "Results" folders}

Google Play Books highlights/notes conversion

Sync all your highlights to Google Drive, google ensures that your highlights are synced in the docx format
Use Pandoc to convert the docx to html pandoc -f docx -t html -o file.html file.docx
Then use the GooglePlayConverter class to convert to json

Kindle Highlights/Notes conversion

Using the Kindle Android or iOS apps, for each book, export your notes to an email. The kindle app attaches an html file of your highlights
Use this html file and the KindleConverter module to convert the highlights to json

Special thanks to @sawyerh for the kindle-email-to-json package, from which the Kindle Converter was derived. I added location, page number and other small parsing updates, mainly for notes etc.

Instructions

Copy the Raw files in the raw directory
Run ./run.sh
Once you have the jekyll data(bunch of .md files in results/jekyllCollection) copy them to the jekyll website(https://github.com/t27/books/ in the _highlights folder)
the updated highlights should now be available at tarangshah.com/books

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.vscode		.vscode
python		python
raw/Play Books Notes		raw/Play Books Notes
results		results
.gitignore		.gitignore
GooglePlayConverter.js		GooglePlayConverter.js
KindleConverter.js		KindleConverter.js
LICENSE		LICENSE
README.md		README.md
convertDocxToHtml.py		convertDocxToHtml.py
index.js		index.js
package-lock.json		package-lock.json
package.json		package.json
postProcessJson.js		postProcessJson.js
renderer.js		renderer.js
run.sh		run.sh
template.html		template.html
test.json		test.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

highlights-convert

Content of the project

Google Play Books highlights/notes conversion

Kindle Highlights/Notes conversion

Instructions

About

Releases

Packages

Languages

License

t27/highlights-convert

Folders and files

Latest commit

History

Repository files navigation

highlights-convert

Content of the project

Google Play Books highlights/notes conversion

Kindle Highlights/Notes conversion

Instructions

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages