The English Language Must Be Accelerated

Macrotonality & Category Theory

Subtitled: Can A Tempo Be A Modal Space?

In mathematics—based on my limited understanding—Set Theory attempts to identify some structure by the composition of, say, the points that comprise it, whereas Category Theory instead momentarily ignores the specific points of composition and instead views the structure as a single unit (i.e. a human as an individual operating in a social milieu and not solely a compilation of red blood cells, organs, and tissue, etc), and then measures said units by its relation to other local units.

Set Theory, to my mind, is how you would compose music if using something like a piano, where each musical line is broken down into precise notes that have static individual values themselves (12 tones, equally tempered, etc), while Category Theory would probably be useful for composing using a structure like language, where the words are comprising a melodic line, but at the same time the words themselves lack a static individual musical value.

This is basically, at a high level, macrotonality to me—where complex vocalization is viewed as a single wave (or line), and is then in turn measured by its relation to other structures in its orbit. In the macrotonal case, the wave is defined by its time value, which is in this case syllables per minute. That value has a relationship with the normal tempo of speech (which is latent), and it also has a relation with the time values of the other active components (“beats,” “percussion,” “drum loops,” “guitar solos,” etc etc.)

The wave’s relation to time integrates it relationally into the rest of the composition, as opposed to its internal line structure or rhyme scheme. So the wave itself is operating by its own internal logics (it contains line structure and rhyme schemes, human beings have kidneys and skin cells) but within the composition itself it’s identified by its categorical time quotient, which is, again, in this case expressed in syllables per minute, and is relational to other components (as well as latent normalized speech).

In other words—we’ve considered groups of tones as modes, but can’t a range of tempo be a mode as well?

Waves Frozen in Time

‘We come down from Truckee surfing against that sun / As if off a great wave but in the / Wrong direction certainly the wave is frozen / Or just moving so slowly that no one can know / If you've done it though you know the feeling’ Robert Ashley, Foreign Experiences

‘There are no points or positions in a rhizome . . . there are only lines . . . when Glenn Gould speeds up the performance of a piece . . . he’s transforming the musical points into lines.’ Deleuze & Guattari, A Thousand Plateaus

What’s the fulcrum of modernity, the fulcrum of the so-called Western world, the fulcrum of the scientific method, the fulcrum of ostensible rationalism, the fulcrum of sanity as we generally understand it if not this notion that ‘things’ can be broken down?—and not just broken down but disassembled into a more or less infinite regression?

To us every atom is a little explanation, every subatomic particle is a tiny special meaning for us to gossip about—to us every building block of nature is a dead frog to be dissected by high school sophomores, every single conversation is something to be recalled and divided into little apple pies of intent and cause and effect, to place on our window sills overlooking our white picket fences. Musically, this trend expresses itself, at first, through musical notes—being quote-unquote ‘noted out’—notes functioning as tone-spaces, and then, eventually, the trend graduates to a concept of microtonality, where said tone-spaces regress to infinite spectrums, infinite spectrums that still contain infinite points.

But listen. I don't want to, you know, like, get into a whole philosophical discussion behind all of this?—other people have explained it better than I could—I don’t even understand it. But compositionally—there’s an idea that, rather than component parts, we could compose via irreducible waves, waves frozen in time, that, yeah, maybe exhibit some attributes that we can notionally attribute to them, but these attributes, they’re attributes that don’t negate the fact that the waves themselves can’t be disassembled—that they don’t actually contain infinite points. That to disassemble them is to change their essence essentially. They contain no points at all.

Composing Spaceship for Sale

Subtitled: What's Macrotonal?

Composing music to some extent is a philosophical judgment on how we interpret the physical world. You can’t escape this ultimately. Ultimately you either continue with a status quo so to speak, a so-called status quo, or you deviate in some substantial manner or another. How you deviate is your philosophical judgment on the world to some extent. You may think, “Oh, haha, I’m going to make a song. Compose some music!” and think in turn that by doing so it’s just some fun thing. But it’s actually the furthest thing from some fun thing. It’s actually incredibly serious, this composing of music. You either make a substantial philosophical judgment on the physical world as you understand it, which inevitably contains the metaphysical world within it (not vice versa as is often supposed). No. These are actually really serious decisions you inevitably are forced to make when composing sound, arranging sound, so-called making music.

Whether we like it or not essentially we have to build things, compose structures, by reducing wave-like phenomena to smaller discrete units. Without discrete units our understanding of the world is essentially impossible. Language is itself a function of the discrete unit! Yet at the same time on some level we remain aware that discrete units are probably lurid fictions. Probably! But who’s to say really? In music, or Western music, the harmonic tradition, these discrete units traditionally are pitched notes, which exist in physical space, which have physical frequencies assigned to them octave by octave, which date all the way back to the invention of the piano and the twelve tone equally tempered scale.

Most of us never question the veracity of this construct. Music is of course a derivation of arranging a series of pitched notes according to very specific mathematical relations that we usually just call harmony. This is composing music for most of us, and most of us will never leave this state of affairs, and most of us probably shouldn’t! The major and minor scales with their notes and chords, those are perfectly fine for many of us! Some of us, however, will follow a path of microtonality, which in a material sense is just following the path of quantum physics (or vice versa). If a note is an atom, the world of the microtone is the musical world of subatomic particles. You know, technically there’s an infinite spectrum of sound between each semitonal, you know! We’ll say things like that when we’re enamored with microtones. You know, just like between 1.9 and 2.0, there are infinite regressions of, like, 1.9999, and 1.999999, and 1.999999997 and shit, there are the same regressions between D and E flat.

This isn’t wrong, but these are the types of things we’ll say if we’re doing the microtone thing. But basically we’re still talking about discrete units. And specifically how we define them! Maybe discrete units aren’t actually discrete units! Because obviously, obviously, at a certain point these infinite regressions of microtones and split atoms starts to make us wonder if a discrete unit is even possible. Like we said, discrete units are probably fictions that are used primarily to enable us to understand the world. But at the same time, what if the opposite were true?

What if, theoretically, what if discrete units in fact did exist, but rather than being smaller than atoms and tones, rather than regressing into infinite decimal points of misery, what if the discrete unit of music was actually a fully-formed sequence. That wasn’t reducible to a series of pitched notes per se, but could only be assigned a value in aggregate. (Not entirely different from a pitched note!) Like a tempo. Like a rap verse. What if that was used as a discrete unit, as the basis of a composition of music. A piece of text but recited at an accelerated tempo so that it’s actually musical in nature. Time of course changes things essentially. Tempo is essentially an intensive form of time, like degrees, percentages, and all that.

So if we take just speech itself, but if we change it’s time value then it becomes something that’s no longer speech as we understand. Rap music has taught us this in this country, that the English language when accelerated becomes a musical line. And it’s totally cool that rap music still views notes as the discrete units that make up bars that are adjustable in some type of atomic manner. But isn’t it possible that’s fictitious? But not in an infinite regression sense of tiny particles and shit? But in a wave-as-unit type of way? But this has always been the difficulty with the notion of text-as-composition, the operas of Robert Ashley come to mind.

The text can’t be referential. The text must be the point of reference. But in order to be a point of reference a fungible value has to be assigned to the text, which can then inform the subsequent tone and sound elements. If the text is the root of the composition then the text must have a mathematical and/or musical value that can then inform the subsequent elements. If you give me a beat in 4/4 time at 100 BPM with a certain sequence of kicks and snares, I can write 3,000 bars to said beat, but the text will inevitably be in reference to the beat, so while we can say the composition may be text-focused, sure, it’s not a text-as-composition.

Rap music is the most language-focused music perhaps in existence. Yet it’s still not text-as-composition. The elements of the composition, they derive from a fulcrum of a beat. It’s the same if you write in equal bars. You’re writing a beat of vocals. No. That’s not text composition. Language requires a new form of musical measurement to truly become composed. If you take a chunk of measured yet unequal prose and vocalize that, then that vocalization perhaps becomes a discrete unit, not to be memorized and performed over and over. It can be ascribed a mathematical value that can then be translated into a sound milieu.

Syllables per minute is a value. If a piece of text is actualized at 200 syllables per minute, then you could arrange a set of tones at 50 BPM and you’d have a 4:1 ratio of text to derived tones. Tone deriving from text. The syllables then essentially become 16th notes. The text becomes the composition. In short, the voice memos of dialogue-based rap verses are the discrete units used to compose Unique Towels, Moons of Uranus, GILF Sundays and various other compositions eventually to be released in this vein. Yet each, while telling a story in some vaguely traditional linear sense, while even functioning as allegories in some sense, are first and foremost philosophical judgments on the world itself. First and foremost that’s basically what they are.

The Inversion of Melisma

“This isn’t spoken word. It’s the reinvention of Sugar Hill.” - Sole

You can’t discuss recitation in America without interfacing with Rap music. I mean. You can. But it would be disingenuous to do so. Not that I’m totally opposed to being disingenuous. There are times when being disingenuous is totally necessary. Just not in this particular case. When I’m discussing music theory and shit.

But what makes rap Rap exactly. No. Let’s. Just this one time. In the service of actually discussing the purely musical components of what deem quote-unquote “rap.” Let’s strip the subjectivity from the equation completely. Subjectivity is. Honestly? It’s so 20th Century to me. This notion of so-called personal experience. Ugh. It’s so sterile. This is perhaps post-subjectivity.

Anyway. What makes rap Rap? Musically? Well it’s obviously speed. It’s tempo. I mean. Okay. To some extent it’s rhyme. It’s the concept of the bar. These are true. But it’s mostly tempo. It’s speech. But contracted so that it operates at an accelerated pace. Obviously the speech needs to be stylistic. In one way or another. It needs to be good. But beyond that. What chiefly distinguishes rap from. For example. Spoken word poetry. Is that it has an increased tempo. And that tempo has a relationship with a piece of music. Even if it’s an electronic loop (most of the time). Now. Sure. You can make an argument that a slower paced delivery. With a temporal relationship to a beat. That that’s still rap. Sure. I don’t disagree. That’s a valid exception to the rule. People can and do rap and slower tempos.

But what about melisma? Isn’t melisma. From Byzantine chant to the Qurra of the Islamic world to the Gospel singers of America. Isn’t that what people generally view as an apex of sorts? An ecstasy of sorts? Where the signifier of the syllable within the grammatical structure of language gets stretched into pure sound? Becomes perhaps unintelligible. Or at least less intelligible. But. Isn’t the inverse of that process double. Triple. Quadruple time rap? Except rather than an expansion of the signifier into (relative) unintellibility we have the contraction of the signifier into (relative) unintelligibility? Doesn’t that. Make perfect sense conceptually?

I think it does. The most quote-unquote technical rappers are the ones who. Generally speaking. Are on the faster side. Big Daddy Kane and Myka 9 started this like over thirty years ago now. And the realm of rap is. Whether you like it or not. Where the most advanced recitative singing and/or vocalization is done in the English language. The English language. With its 44 phonemes. And. What? Eleven vowel sounds? Is preternaturally disposed to the contraction of itself. As opposed to the expansion that the Romance languages are. Consonants are everywhere in English.

Yet one place where Rap has. At least very rarely. Dared to go is outside of this concept of bar. The vast (vast!) majority of rap is constructed on this concept. That the relationship between the vocal and the music is one of syncopation on the bar level. This is in the vernacular. The line of the rapper is supposed to match up with the bar of music. Obviously you should rhyme too. But the rhyme should always. Ideally. Land on the same snare. Or kick. Of each line of music. This is essentially a spatial relationship. The lines extend the same length. Length resides in space.

But you could have a temporal relationship too. Right? My idea is that. I don’t know. Maybe you write unequal lines of text. But the vocal and the music exist in a temporal relationship. Now that relationship doesn’t necessarily need to be 1:1. In fact I think it’s better if it’s not. But if you have a 4/4 beat at 90 BPM then you could equate each syllable of text to. Say. A 16th note. Which at 90 BPM would impute 360 syllables per minute rapped. So if you’re rapping at or around that rate. Then you’re in a 4x temporal relationship with the beat.

It’s really that simple! You could increase the BPM of that 4/4 beat to 180 BPM. The vocals can stay static. You’d be at a 2x relationship. Or syllables would be essentially 8th notes. This is audible. Even as the signifier becomes less. Yet in this instance there’s another inversion. There’s an inverted melisma. But compositionally. Realistically. You’re probably setting the BPM based on the vocal. As opposed to selecting a beat and then constructed a verse to rap over it at that set tempo.

But to fit these many syllables into a verse? How uneven should they be? I’d personally say they should occupy the 8th interval of the Fibonacci sequence. Sitting somewhere between 34 and 55 syllables. Each line. That gives each line enough variability. But not too much variability. And it packs enough syllables into a single line that velocity can be reached. But there’s still room to. You know. Breathe?

Melisma is the. Extended technique? That brings the signifier of language into. As Charlie Looker notably said. Not into abstraction. But into raw material. Raw sound. There is no longer any representational reference. This is done by slowing. Expanding. By assigning many notes to a single syllables. The inversion of this is the opposite. But circuitously ends at a very similar results. By assigning many syllables to a single note. Quadruple time. The Ison and Byzantine cantor. The text. Of course it’s textual. But it’s. Via melisma. Or the inversion of melisma. It achieves a breaking with the signifier. A text as raw sound. As opposed to signifying representational items. It’s not a coincidence that the inversion of melisma has achieved popularity in America.

In the English language. Melisma never sounds as good in English as it does in. Literally any other language. But especially the Romance languages. The Latin languages. Or the Semitic languages. But rap. The inversion of melisma. It never sounds quite as good in those Romance languages. The vowel-based languages. With fewer phonemes. They can’t stylize the inversion of melisma the way English can. Just as English. With 87 vowel sounds surrounded by infinite consonants. Can never get melisma to quite the technical level or Italian. Or Greek. Or Arabic. Yet this inversion of melisma. I mean. Melisma isn’t a bar-based style. Rap as we understand it today? It’s incapable of truly reaching appropriately unhinged levels of inverted Melisma. Melisma is naturally uneven. So to truly invert melisma. It requires a method to make the lines uneven. But still somehow relate to the specific music as well. Which has been shown here.

Notes on Music (05.01.24)

Classical music as we understand it from Europe de-emphasized the human voice and tempo. The former tendency is somewhat unique. Other ‘classical’ traditions feature the human voice as a - if not the - focal point. Which makes some sense. The cheapest musical instrument is, after all, your voice. But harmonic music, which is essentially European music, which is arguably an extension of a well-tempered scale, eschews tempo as well as voice.

But to be fair you can only focus on so much. And when you have a plethora of complex chords suddenly at your disposal, which themselves can be difficult to achieve even in isolation, never mind to progress in conjunction with other complex chords, then it’s understandable that tempo wouldn’t necessarily be a focal point. Likewise with the human voice. The human voice, unlike the guitar or piano, obviously can’t express two or three or four notes at once. Plus, it’s not naturally well-tempered like other melodic instruments. It’s inveterately microtonal (at least pre auto-tune). It requires not only the writing of notes but also the writing of words to truly compose for it.

If we wanted to oversimplify things we could say that when the temperament of an octave is equal (i.e. 12-TET), then chords become more of an emphasis. And when chords become more of an emphasis the human voice and specific tempo necessarily become de-emphasized.

American pop and rock (which for a time at least was essentially synonymous with American pop) extend in a linear fashion from this emphasis of the chord of European classical music. Of course there are vocals in pop and rock. But the central component of the song is the chord and its progressions. The vocal extends from the chord and not vice versa. Even in rock’s more avant-garde offshoots like punk and metal the chord generally maintains its central location. It’s only when you get to the most extreme iterations, usually in metal, that this shifts at all (and even most death metal, to be fair, is still chord-driven).

Rap, on the other hand, is an (African-)American music that became a popular music but that exists in contradistinction to the European classical model. In ‘traditional’ rap there are often no chords at all. And certainly no progressions. In fact, in traditional rap there are no instruments at all sans the human voice. Only samples of instruments: a drum break, a short looped instrumental passage. A bassline maybe. And then vocals (I’ll leave DJ cuts to the side for now).

Rap is in essence a vocal music. Yet at the same time, as a vocal music, rap also takes into account the peculiar character of the English language. As opposed to, say, trying to mimic Italian opera. Forty four phenomes (unique sounds) exist in English, as opposed to an average of maybe 25 to 30 for other languages. That’s anywhere from 46 to 76% more unique sounds that the average world language contains. There are 11 vowels sounds. Most other languages have 5 or 6. So give or take 100% additional vowel sounds. All of this is to say that the English language, from a musical standpoint, is an extremely extended scale. It’s like playing guitar on 24 TET instead of 12 TET. Or playing a fretless string instrument as opposed a well-tempered one. The more an octave expands to more it tends toward melody over harmony.

Now if English just had more phenomes, but it’s vowel sounds were traditionally reduced? Then maybe you could fairly easily construct a music that’s less harmonic, more vocal, but more melismatic. Like Ottoman classical music. But the number of vowel sounds and English’s tendency toward hard consonants as opposed to free-flowing mellifluous long words make melisma more of an instrinsic challenge. And with vocal music . . . language must underpin the voice. Which makes the writing of a melismatic music more clunky.

But rap does away with this challenge by removing melisma altogether. No. Rap is a vocal music. Yes. But in place of melismatics it substitutes tempo. Rather than extending a syllable for three or four or five beats it extends the breath those beats. But then it fills that breath with as many syllables as it can possibly fit.

It allows the hard consonant tendency of English to achieve speed via tempo, as opposed to inviting clunkiness via melismatics. Which isn’t to say there isn’t a melody to rap. Obviously there is. But it’s the melody of the speech. The melody of the the mode. A reduced octave (because the octave has expanded). It’s the melody of speech just reimagined at an accelerated tempo.

What I’ve just described could also just as easily describe the American operas of Robert Ashley.

Theory of Self-Similar Composition

Or: Two Forms of Intervals & Jimothy Prits Pragma Blothworth

Rock music like post-screamo and satanic black metal is fun, but composing it requires us to make a few determinations on the procession of time.

One way we can look at time is that 1 beat equals 1 beat, and maybe there will be 4 beats per line? Yes. There will be 4 beats per line. And these 4 beats will be divisible into 4 iterations of 1 beat, and 1 beat will always equal 1 beat. Each beat will, true, comprise 25% of the line (1/4=.25), but 1 beat always equals 1 beat. 1=1

So if we were to take this first way of looking at time and map it out numerically, so we can see how our time is progressing, we should make it as simple as possible. Let's multiply everything by 100, so the first beat starts at 100 instead of 1—this will make it easier for us track our progression without resorting to decimal points, which everyone hates. So we start at 100. Each beat is 1 beat, but each beat is a fourth of the line, which is 25% (1/4=.25), so each beat adds 25 to the first beat (which is 100), so the first line looks like this:

100+25+25+25+25, or: 100 then 125 then 150 then 175 then (beginning of second line) 200. We're increasing the line incrementally by 1 beat, which is 25% of the line, but 1 beat always equals the same thing, 1 beat. 1=1. So our first two lines proceed as follows: [100]-125-150-175]-[200]-225-250-275]-[300]...etc, etc

But of course another way we could look at this is that 25% of the line equals 25% of the line. 25%=25%. But how would that look? Any different? Let's start again at the first beat, which we'll start again at 100:

100*(1.25)*(1.25)*(1.25)*(1.25), or: 100 then 125 then 156.25 then 195.3125 then (beginning of second line) 244.1406. When 25% equals 25% our progression, it seems, is no longer distributed in even increments. 1 beat no longer equals 1 beat when 25% equals 25%. Yet, on a mathematical note, when our increments were equal (when 1 beat equaled 1 beat), then our percentages were no longer equal. For example: to get from 100 to 125, you would add 25% (100*1.25=125) to 100. But to get from 125 to 150 you would only add 20% (125*1.20=150)! So 25%=25% then 1 no longer equals 1. But if 1=1 then 25% no longer equals 25%.

So if 25% equals 25% then our first 2 lines look like this: [100-125-156.25-195.3125]-[244.1406]-305.1758-381.4697-476.8372]-[596.0464].

We might look at these intervals and say, "Wow those are random ass numbers, dude—way different than 4/4 time!" Yet is this really the case? In our first way of looking at time 1 beat equaled 1 beat, but 25% didn't always equal 25%. In this case 25% equals 25% but 1 beat doesn't always equal 1 beat.

In mathematical jargon we might say that 100 to 125 to 150...(etc, etc) is a way of proceeding extensively, while 100 to 125 to 156.25...(etc, etc) is a way of proceeding intensively. We might say the first way is a strophic (repetitive) approach to composing, while the second way is a self-similar (fractal) approach to composing.

In conclusion, these are two ways of calculating intervals and thinking about the inexorable procession time while composing music.