Though Ivaylo Strandjev's answer does seem to be the common definition, I believe that common definition hasn't done justice by blurring the definition of PATRICIA. After all, there are already umbrella terms for this common definition.
I continued to search for a specific definition from past reputable sources to confirm what I had suspected, and I'm writing to provide my findings. Perhaps the most significant, though the least precise in terms of logical language is the original paper defining PATRICIA, published by DR Morrison in October 1968s "Journal of the ACM":
PATRICIA evolved from "A Library Automaton" [3] and other studies. ...
Early in this evolution it was decided that the alphabet should be
restricted to a binary one. A theorem which strongly influenced this
decision is one which, in another form, is due to Euler. The theorem
states that if the alphabet is binary, then the number of branches is
exactly one less than the number of ends. Corollaries state that as
the library grows, each new end brings into the library with it
exactly one new branch, and each branch has exactly two exits. These
facts are very useful in the allocation of storage for the index. They
imply that the total storage required is completely determined by the
number of ends, and all of the storage required will actually be used.
This seems to contradict points 2 and 3 of the libstdc++ reference. There's further evidence in this paper, though not quite as clear, to suggest that point 4 is also inaccurate.
I found a page entitled Patricia Variations that states quite clearly, and to the point, in the section named "Standard Patricia":
Patricia creates a binary tree, with one node per string stored. Each
node records a bit offset, and you search for a string by moving down
the tree, inspecting the bit with the given offset at each node, and
branching left or right, depending on whether the bit inspected is
zero or one. The search terminates when the bit offset you are to
inspect stops increasing. This means that you have either looped back
to revisit the node you have just left, or have moved up from a node
to visit one of its ancestors...
This paragraph clearly contradicts points 2 and 3 of the libcstd++ reference. It implies that all nodes are internal nodes, as I suspected, that descendants in a PATRICIA trie won't always share a common prefix (eg. when a branch points back up the trie) which further contradicts point 4(B). Perhaps the most useful aspect of all, in the first paragraph it mentions a very reputable source: "section 6.3 of Knuth Volume 3 (The Art of Computer Programming, by D.E. Knuth)", which I've had a quick glance at. Knuth's description doesn't seem to significantly deviate from Morrison's idea, either.
In conclusion, there's a common, obscure definition which seems to have become a synonym for "radix trie" (and perhaps in some cases eg. libstdc++, "multi-way radix trie"), and a specific, original definition which deviates from the common definition quite significantly. I hope the resources revealed in this answer can be as helpful to other students intrigued by PATRICIA as it has been to me... Peace!