Audio CDs are working well!

2022-08-25 10:17:25 -06:00 · 2022-08-25 10:17:25 -06:00 · ede1fd22be
parent 91a332ca53
commit ede1fd22be
7 changed files with 91 additions and 63 deletions
--- a/README.md
+++ b/README.md
@ -11,15 +11,22 @@ and then re-encode the content to a compressed format.
 At the time I'm writing this README, it will:
-* ~~Rip audio CDs, look them up in cddb, encode them to VBR MP3, then tag them.~~ A rewrite broke this; I plan to fix it soon.
+* Rip audio CDs, look them up in cddb, encode them to VBR MP3, then tag them. 
  * It also writes a shell script you can modify to quickly change the tags, since this is a pretty common thing to want to do.
 * Rip video DVDs, transcode them to mkv
 ## Requirements
-* HandBrakeCLI
+The requirements are fairly light: a few CD tools, cdparanoia, HandBrakeCLI, and some
-* cdparanoia
+DVD libraries.
-* cd-discid
+
-* 
+Most notably, you do *not* need a relational database (SQLite, Postgres, MySQL).
 You just need a file system.
 For a complete list of requirements,
 look at the [Dockerfile](Dockerfile) 
 to see what Debian packages it installs.
 ## How To Run This
@ -34,9 +41,8 @@ Mine is `/srv/ext/incoming`.
        -v /srv/ext/incoming:/incoming \
        registry.gitlab.com/dartcatcher/media-sucker/media-sucker
-I can't get it to work with docker swarm.
+I can't get it to work with docker swarm,
-Presumably some magic is happening with `--device`.
+which doesn't support `--device`.
 It probably has something to do with selinux.
 Stick a video DVD or audio CD in,
 and the drive should spin up for a while,
@ -46,9 +52,14 @@ or a new directory of `.mp3` files (for audio).
 You can watch what it's doing at http://localhost:8080/
 ## A note on filenames and tags
 This program does the absolute minimum to try and tag your media properly.
 Partly because I'm a lazy programmer,
 but mostly because the computer can only guess at things that you,
 the operator,
 can just read off the box.
 For DVDs, that means reading the "title" stored on the DVD,
 which I've seen vary from very helpful (eg. "Barbie A Fashion Fairytale")
@ -62,13 +73,10 @@ so CDDB takes the length of every track in seconds and tries to match that
 against something a user has uploaded in the past.
 This is wrong a whole lot of the time.
 If CDDB can't find a match for an audio CD,
 this program will append the datestamp of the rip to the album name,
 in the hopes that you can remember about what time you put each CD in the drive.
 So for stuff like multi-CD audiobooks, that's pretty helpful.
 But the end result in almost every case is that you're going to have to
-manually edit the metadata.
+rename the movie file, or re-tag the audio files.
 This is why you get a `tag.sh` file with every audio CD rip.
 ## Answers
@ -76,35 +84,23 @@ I'm skipping the part where I make up questions I think people might have.
 ### Why I Wrote This
-The `automatic-ripping-machine` looks really badass.
+The automatic-ripping-machine looks really badass.
 But after multiple attempts across multiple months
 to get it running,
 I decided it would probably be faster just to write my own.
-This isn't as cool as the aumomatic-ripping-machine.
+media-sucker isn't as cool as the automatic-ripping-machine.
 But, at least for me,
-it's a lot more functional,
+it's more useful,
-in that it actually does something.
+in that I can get it to actually do something.
 ### Why You Should Run This
 The only reason I can think of that anybody would want to use this is if they,
 like me,
-are too dumb to get the `automatic-ripping-machine` to work.
+are too dumb to get the automatic-ripping-machine to work.
 ### What Kind Of Hardware I Use
 I run it on a Raspberry Pi 4,
 with a Samsung DVD drive from the stone age.
 ## Parting note
 As of 2022-08-22, large sections of this code were written under COVID brain-fog.
 This means it's going to look a lot like a 13-year-old wrote it.
 I hope one day to clean it up a bit,
 but it's working fairly well, 
 despite the mess.
 Please don't judge me for the organization of things.
 Judge bizarro universe Neale instead.
--- a/src/cd.py
+++ b/src/cd.py
@ -25,6 +25,7 @@ def scan(state, device):
    )
    discid = p.stdout.strip()
    state["discid"] = discid
    cddb_id = discid.split()[0]
    # Look it up in cddb
    email = os.environ.get("EMAIL") # You should really set this variable, tho
@ -44,10 +45,9 @@ def scan(state, device):
        for k in ("title", "artist", "genre", "year", "tracks"):
            state[k] = disc[k]
    else:
        now = time.strftime("%Y-%m-%dT%H%M%S")
        num_tracks = int(discid.split()[1])
-        state["title"] = "Unknown CD - %s" % now
+        state["title"] = "Unknown CD - %s" % cddb_id
-        state["tracks"] = ["Track %02d" % i for i in range(num_tracks)]
+        state["tracks"] = ["Track %02d" % (i+1) for i in range(num_tracks)]
 def copy(state, device, directory):
@ -84,9 +84,20 @@ def copy(state, device, directory):
 def encode(state, directory):
    track_num = 1
    total_tracks = len(state["tracks"])
    durations = [int(d) for d in state["discid"].split()[2:-1]]
    total_duration = sum(durations)
    encoded_duration = 0
    tag_script = io.StringIO()
    tag_script.write("#! /bin/sh\n")
    tag_script.write("\n")
    tag_script.write("ALBUM=%s\n" % state["title"])
    tag_script.write("ARTIST=%s\n" % state.get("artist", ""))
    tag_script.write("GENRE=%s\n" % state.get("genre", ""))
    tag_script.write("YEAR=%s\n" % state.get("year", ""))
    tag_script.write("\n")
    for track_name in state["tracks"]:
        logging.debug("Encoding track %d (%s)" % (track_num, track_name))
        duration = durations[track_num-1]
@ -97,8 +108,13 @@ def encode(state, directory):
            "--disptime", "1",
            "--preset", "standard",
            "--tl", state["title"],
-            "--tn", "%d/%d" % (track_num, len(state["tracks"])),
+            "--tn", "%d/%d" % (track_num, total_tracks),
        ]
        tag_script.write("id3v2")
        tag_script.write(" --album \"$ALBUM\"")
        tag_script.write(" --artist \"$ARTIST\"")
        tag_script.write(" --genre \"$GENRE\"")
        tag_script.write(" --year \"$YEAR\"")
        if state.get("artist"):
            argv.extend(["--ta", state["artist"]])
        if state.get("genre"):
@ -107,11 +123,16 @@ def encode(state, directory):
            argv.extend(["--ty", state["year"]])
        if track_name:
            argv.extend(["--tt", track_name])
            tag_script.write(" --song \"%s\"" % track_name)
            outfn = "%02d - %s.mp3" % (track_num, track_name)
        else:
            outfn = "%02d.mp3" % track_num
        argv.append("track%02d.cdda.wav" % track_num)
        argv.append(outfn)
        tag_script.write("\\\n    ")
        tag_script.write(" --track %d/%d" % (track_num, total_tracks))
        tag_script.write(" \"%s\"\n" % outfn)
        p = subprocess.Popen(
            argv,
            cwd = directory,
@ -129,10 +150,14 @@ def encode(state, directory):
        encoded_duration += duration
        track_num += 1
    with open(os.path.join(directory, "tag.sh"), "w") as f:
        f.write(tag_script.getvalue())        
 def clean(state, directory):
-    pass
+    for fn in os.listdir(directory):
-
+        if fn.endswith(".wav"):
            os.remove(os.path.join(directory, fn))
 if __name__ == "__main__":
    import pprint
--- a/src/dvd.py
+++ b/src/dvd.py
@ -43,7 +43,7 @@ def scan(state, device):
        title = lsdvd["provider_id"]
        if title == "$PACKAGE_STRING":
            title = "DVD"
-    now = time.strftime(r"%Y-%m-%dT%H%M%S")
+    now = time.strftime(r"%Y-%m-%dT%H:%M:%S")
    title = "%s %s" % (title, now)
    # Go through all the tracks, looking for the largest referenced sector.
--- a/src/encoder.py
+++ b/src/encoder.py
@ -12,6 +12,7 @@ import re
 import logging
 import dvd
 import cd
 import traceback
 import worker
 class Encoder(worker.Worker):
@ -24,16 +25,20 @@ class Encoder(worker.Worker):
        while True:
            wait = True
            self.status = {"type": "encoder", "state": "idle"}
-            for fn in glob.glob(self.workdir("*", "state.json")):
+            for fn in glob.glob(self.workdir("*", "sucker.json")):
-                self.encode(os.path.dirname(fn), obj)
+                directory = os.path.dirname(fn)
                state = self.read_state(directory)
                try:
                    self.encode(directory, state)
                except Exception as e:
                    logging.error("Error encoding %s: %s" % (directory, e))
                    logging.error(traceback.format_exc())
                wait = False
            if wait:
                time.sleep(12)
-    def encode(self, directory, obj):
+    def encode(self, directory, state):
        self.status["state"] = "encoding"
        state = self.read_state(directory)
        self.status["title"] = state["title"]
        if state["video"]:
@ -46,6 +51,7 @@ class Encoder(worker.Worker):
            self.status["complete"] = pct
        media.clean(state, directory)
        self.clear_state(directory)
        logging.info("Finished encoding")
--- a/src/mediahandler.py
+++ b/src/mediahandler.py
@ -1,6 +0,0 @@
 class MediaHandler:
    def __init__(self, basedir, state):
        self.basedir = basedir
        self.state = state
    def 
--- a/src/reader.py
+++ b/src/reader.py
@ -31,7 +31,7 @@ CDROM_EJECT = 0x5309
 class Reader(worker.Worker):
    def __init__(self, device, directory):
-        super().__init__(device)
+        super().__init__(directory)
        self.device = device
        self.status["type"] = "reader"
        self.status["device"] = device
@ -66,24 +66,22 @@ class Reader(worker.Worker):
                rv = fcntl.ioctl(self.drive, CDROM_DISC_STATUS)
                try:
                    if rv == CDS_AUDIO:
-                        self.handle(false)
+                        self.handle(False)
                    elif rv in [CDS_DATA_1, CDS_DATA_2]:
-                        self.handle(true)
+                        self.handle(True)
                    else:
                        logging.info("Can't handle disc type %d" % rv)
                except Exception as e:
                    logging.error("Error in disc handler: %s" % e)
                    logging.error(traceback.format_exc())
                self.eject()
-            elif rv in (CDS_TRAY_OPEN, CDS_NO_DISC):
+            elif rv in (CDS_TRAY_OPEN, CDS_NO_DISC, CDS_DRIVE_NOT_READ):
                time.sleep(3)
            else:
                logging.info("CDROM_DRIVE_STATUS: %d (%s)" % (rv, CDS_STR[rv]))
                time.sleep(3)
    def eject(self):
        self.status["state"] = "ejecting"
        for i in range(20):
            try:
                fcntl.ioctl(self.drive, CDROM_LOCKDOOR, 0)
@ -100,16 +98,18 @@ class Reader(worker.Worker):
        state = {}
        state["video"] = video
        if video:
            media = cd
        else:
            media = dvd
        else:
            media = cd
        media.scan(state, self.device)
        self.status["title"] = state["title"]
        subdir = slugify.slugify(state["title"])
        workdir = self.workdir(subdir)
        os.makedirs(workdir, exist_ok=True)
        self.status["state"] = "copying"
-        for pct in media.copy(device, self.workdir(subdir)):
+        for pct in media.copy(state, self.device, workdir):
            self.status["complete"] = pct
        self.write_state(subdir, state)
--- a/src/worker.py
+++ b/src/worker.py
@ -1,13 +1,13 @@
 import threading 
 import os
 import json
 import logging
 class Worker(threading.Thread):
    def __init__(self, directory, **kwargs):
        self.directory = directory
        self.status = {
            "state": "idle",
            "directory": directory,
        }
        kwargs["daemon"] = True
@ -17,9 +17,16 @@ class Worker(threading.Thread):
        return os.path.join(self.directory, *path)
    def write_state(self, subdir, state):
-        with open(self.workdir(subdir, "state.json"), "w") as f:
+        logging.debug("Writing state: %s" % repr(state))
-            json.dump(f, state)
+        statefn = self.workdir(subdir, "sucker.json")
        newstatefn = statefn + ".new"
        with open(newstatefn, "w") as f:
            json.dump(state, f)
        os.rename(newstatefn, statefn)
    def read_state(self, subdir):
-        with open(self.workdir(subdir, "state.json")) as f:
+        with open(self.workdir(subdir, "sucker.json")) as f:
            return json.load(f)
    def clear_state(self, subdir):
        os.unlink(self.workdir(subdir, "sucker.json"))