Extend y-websocket provider to support sub docs synchronization in one websocket connection

LeeSanity · August 5, 2022, 2:23am

The native implementation of y-websocket only supports one doc in a room, when nests sub documents in one doc, you have to start a websocket connection for each sub document, this can be not acceptable if you have a server, there can be too many connections in the same time. So, I extend y-websocket, adding doc guid (main document or sub document) into the sync message, then other clients or servers can fetch the right doc to update when receive an message.

here is my implementation (see multidoc branch):
y-multidoc-websocket

client use as follows:

const mainDoc = new Y.Doc()
const provider = new WebsocketProvider(host, roomname, mainDoc)

mainDoc.on('subdocs', ({added, removed, loaded}) => {
  loaded.forEach(subdoc => {
    provider.addSubdoc(subdoc) 
  })
})

const subDoc = mainDoc.get('sub')
subDoc.load()

// after sync finished, you can fetch data out of the sub document

Gin-Quin · August 8, 2022, 8:31am

I am very interested by this For now I have one connection per document so it can grow to huge numbers very fast.

LeeSanity · August 8, 2022, 9:50am

Yeah, my application had the same problem before, I fixed it by this way.

douira · October 28, 2022, 7:11pm

Hi! This is very interesting for my use case too. The documentation mentions

Providers (e.g. y-websocket, y-indexeddb) are responsible for syncing subdocuments. Not all providers support subdocuments yet.

This seems to imply they would receive native support for subdocuments in the future. Do you think your implementation is similar to how this would be implemented by the package officially?

Since I am integrating with synced storage and Vue, I wonder how explicitly loading subdocuments would work with the reactivity mechanism (especially since the subdocument is effectively missing/empty until it’s loaded).

Since my application involves users collaborating on potentially multiple documents, I would also like to use awareness to display cursor positions and participants. Do you think splitting awareness up such that each user only gets the relevant awareness data for the documents they are participating in is possible? For example, if I have 100 documents but one particular user is only using 1 or 2 of them, I don’t want all the cursor and user activity of the users of all the other documents to be synchronized with everybody.

LeeSanity · November 4, 2022, 8:10am

This seems to imply they would receive native support for subdocuments in the future. Do you think your implementation is similar to how this would be implemented by the package officially?

Sorry, I haven’t seen the official implementation, but the basic idea is same I think. Yjs provides most basic api already, you can implement your own functions with these basic parts.

Do you think splitting awareness up such that each user only gets the relevant awareness data for the documents they are participating in is possible?

sure, you can do that, awareness is also an independent module, you can customize it as your needs !

folencao · November 28, 2022, 1:35am

@LeeSanity - Appreciate your efforts on this, that is what I am exactly looking for(i.e. one connection supports multiple docs), I will do some experiments on your code/bran.

Regarding the official network provider as @douira mentioned, I think they don’t support multiple docs for now after reading the code.

folencao · April 25, 2023, 3:36am

@LeeSanity - I think in your code has a bug, the if condition seems wrong, if the second user access that subdoc, then it won’t trigger the sync step 1

github.com

smoothnlp/y-websocket/blob/multidoc/bin/utils.js#L198


      
          switch (messageType) {
            case messageSync:
              let targetDoc = doc
              const docGuid = decoding.readVarString(decoder)
              if (docGuid !== doc.name) {
                // subdoc
                targetDoc = getYDoc(docGuid, false)
                if (!targetDoc.conns.has(conn)) targetDoc.conns.set(conn, new Set())
          
          
      /**@type {Map<String, Boolean>}*/ const subm = subdocsMap.get(doc.name)
                if (subm && subm.has(targetDoc.name)) {
                  // sync step 1 done before.
                } else {
                  if (subm) {
                    subm.set(targetDoc.name, targetDoc)
                  } else {
                    const nm = new Map()
                    nm.set(targetDoc.name, targetDoc)
                    subdocsMap.set(doc.name, nm)
                  }

if (subm && subm.has(targetDoc.name)) {
            // sync step 1 done before.
          }

LeeSanity · April 26, 2023, 4:09am

Yeah, you are right !!! It was a quick POC of one day work. Can you just request an issue in the repo? , I will fix it later. Thanks!

minchopm · May 24, 2023, 8:57pm

@LeeSanity Do you have any suggestions how to add and remove subdocs directly from the backend and send event to the user?

LeeSanity · May 29, 2023, 8:25am

based on my understanding, you can add/remove subdoc in the same way with the client side, to be simple, you can treat the backend as client !

minchopm · September 5, 2023, 1:13pm

@LeeSanity I have implemented the solution, but once the browser has been refreshed, only the root document has been synced, subdocs are existing in the document as empty structure without content. Any suggestions how to solve this.

LeeSanity · September 8, 2023, 4:47am

In the root document, subdocs are only references, you must initialize subdocs by yourself

jiangxiaoqiang · April 12, 2025, 4:04am

what is the relationship of Y.Doc subdocs and WebsocketProvider subdocs

jiangxiaoqiang · May 2, 2025, 8:18am

I have implement the subdocument followed by this talk, this is the repo texhub-broadcast/src/websocket/conn/socket_io_client_provider.ts at main · RedDwarfTech/texhub-broadcast · GitHub, hope this can give you a clue.