bunkai is a parser for catalogued ROM file names that follow the following common naming conventions.
bunkai does not use regular expressions and supports a variety of edge cases for each supported naming convention. It is ported from the well tested shiratsu-naming Rust library, but provides a more ergonomic unified C# API. Unlike shiratsu-naming
, bunkai does not keep trivia such as malformed TOSEC tag orders and is a lossy parser mainly for scraping purposes.
bunkai is built with the Pidgin parser combinators library.
Bunkai is a work in progress and while not all features available in shiratsu-naming will be implemented (particularly any trivia items like TOSEC warnings), the following features are intended.
- No-Intro
- Scene Number
- BIOS
- Title
- Region
- Language
- Multi Language
- Version
-
Rev
Versions -
v
Single Prefixed Versions -
Version
Single Prefixed Versions withAlt
- Unprefixed
1.x
versions -
Version
prefixed date versions (Redump BIOS versions) - Comma-separated version tags
-
- Release
- Bad Dump
- Redump Disc tag
- Redump Multi-tap tag
- TOSEC
- ZZZ-UNK- (Omitted)
- Demo
- Dates
- Undelimited Dates
- Version
-
Rev
versions -
v
versions - Version in flag
-
- Title
- Degenerate titles missing demo or date
- Unexpected spaces
- Publisher
- by-publisher in Title if ZZZ-UNK
- by-publisher after tags if ZZZ-UNK
- Region
- GoodTools region
- Language
- Multilanguage
- System
- Video
- Copyright
- Media
- Devstatus
- Dump info
- More info
- GoodTools
- Region
- Year
- Translation
-
REV
Revision - Version
-
VWIPX
-
VFinal_
-
Vunknown
-
V x.xx
-
V bX
-
V nn
-
V nnnn
-
V_
-
- Language
- Multilanguage
- Volume
- Dump tags
- Hack tag
- Other
-
PD
-
NTSC
-
PAL
-
NTSC-PAL
-
PAL-NTSC
-