advLigoRTS issueshttps://git.ligo.org/cds/software/advligorts/-/issues2020-02-13T23:29:59Zhttps://git.ligo.org/cds/software/advligorts/-/issues/54buffer overflow in daqd2020-02-13T23:29:59ZDavid Barkerbuffer overflow in daqdtransfer from bugzilla bug 779transfer from bugzilla bug 779Jonathan HanksJonathan Hankshttps://git.ligo.org/cds/software/advligorts/-/issues/65FrVect structures in the minute- and second-trend frames do not have values set2020-02-13T23:35:51ZErik von ReisFrVect structures in the minute- and second-trend frames do not have values setFrom bugzilla ticket #1097
Duncan Michael Macleod (Cardiff University) 2017-06-26 09:50:21 PDT
FrVect structures in the minute- and second-trend frames do not have values set
for the following attributes (at least)
- name
- unitX
- u...From bugzilla ticket #1097
Duncan Michael Macleod (Cardiff University) 2017-06-26 09:50:21 PDT
FrVect structures in the minute- and second-trend frames do not have values set
for the following attributes (at least)
- name
- unitX
- unitY
where these are set in the upstream 'raw' frames. It would be nice to have
these set to match those in the upstream, where appropriate (i.e. not unitY for
.n trends)David BarkerDavid Barkerhttps://git.ligo.org/cds/software/advligorts/-/issues/55IOP DACKILL channels in SDF have constantly updating "Time of last change"2020-02-13T23:40:29ZErik von ReisIOP DACKILL channels in SDF have constantly updating "Time of last change"from bugzilla ticket #1002
Keith: 2016-04-08 11:51:44 PDT
Even after the upgrade to RCG 3.0, the IOP DACKILL channels in SDF still show
that the "Time of last change" is constantly updating
- See https://services.ligo-la.caltech....from bugzilla ticket #1002
Keith: 2016-04-08 11:51:44 PDT
Even after the upgrade to RCG 3.0, the IOP DACKILL channels in SDF still show
that the "Time of last change" is constantly updating
- See https://services.ligo-la.caltech.edu/FRS/show_bug.cgi?id=3796
- It was claimed have been fixed in Bug 850
https://bugzilla.ligo-wa.caltech.edu/bugzilla3/show_bug.cgi?id=850Jonathan HanksJonathan Hankshttps://git.ligo.org/cds/software/advligorts/-/issues/57ECAT SYS front ends do not have dcuid defined in the NAME_DCU_ID epics record2020-02-13T23:41:08ZErik von ReisECAT SYS front ends do not have dcuid defined in the NAME_DCU_ID epics recordFrom bugzilla ticket #1026
Dave 2016-06-23 11:17:10 PDT
each front end reports its DCU id using an epics record called NAME_DCU_ID. For
the ECAT SYS front ends this record has a zero value.
e.g.
caget H1:SYS-ECATC1PLC1SDF_DCU_ID
H1:...From bugzilla ticket #1026
Dave 2016-06-23 11:17:10 PDT
each front end reports its DCU id using an epics record called NAME_DCU_ID. For
the ECAT SYS front ends this record has a zero value.
e.g.
caget H1:SYS-ECATC1PLC1SDF_DCU_ID
H1:SYS-ECATC1PLC1SDF_DCU_ID 0Jonathan HanksJonathan Hankshttps://git.ligo.org/cds/software/advligorts/-/issues/69SDF RELOAD_STATUS should be non-zero whenever a load is pending2020-02-14T00:06:05ZDavid BarkerSDF RELOAD_STATUS should be non-zero whenever a load is pendingtransfer from Bugzilla bug 924 (enhancement)transfer from Bugzilla bug 924 (enhancement)Jonathan HanksJonathan Hankshttps://git.ligo.org/cds/software/advligorts/-/issues/72SDF_MON_ALL should turn on monitoring of all settings against current set points2020-02-14T00:52:11ZDavid BarkerSDF_MON_ALL should turn on monitoring of all settings against current set pointstransferred from bugzilla bug 774 (enchancement)transferred from bugzilla bug 774 (enchancement)Jonathan HanksJonathan Hankshttps://git.ligo.org/cds/software/advligorts/-/issues/115useless DAQD dependencies during package build2020-05-11T23:10:00ZJameson Rollinsjameson.rollins@ligo.orguseless DAQD dependencies during package build```
dpkg-shlibdeps: warning: package could avoid a useless dependency if debian/advligorts-daqd/usr/bin/nds debian/advligorts-daqd/usr/bin/daqd were not linked against libframecpp.so.12 (they use none of the library's symbols)
dpkg-shlib...```
dpkg-shlibdeps: warning: package could avoid a useless dependency if debian/advligorts-daqd/usr/bin/nds debian/advligorts-daqd/usr/bin/daqd were not linked against libframecpp.so.12 (they use none of the library's symbols)
dpkg-shlibdeps: warning: package could avoid a useless dependency if debian/advligorts-daqd/usr/bin/daqd was not linked against libca.so.3.15.5 (it uses none of the library's symbols)
dpkg-shlibdeps: warning: package could avoid a useless dependency if debian/advligorts-fe/usr/bin/standalone_edc was not linked against libCom.so.3.15.5 (it uses none of the library's symbols)
dpkg-shlibdeps: warning: package could avoid a useless dependency if debian/advligorts-fe/usr/bin/standalone_edc was not linked against libboost_filesystem.so.1.67.0 (it uses none of the library's symbols)
dpkg-shlibdeps: warning: package could avoid a useless dependency if debian/advligorts-transport-pubsub/usr/bin/cds_pub_sub debian/advligorts-transport-pubsub/usr/bin/cps_recv debian/advligorts-transport-pubsub/usr/bin/cps_xmit were not linked against libboost_system.so.1.67.0 (they use none of the library's symbols)
```Jonathan HanksJonathan Hankshttps://git.ligo.org/cds/software/advligorts/-/issues/132Unify gdstask.c between DAQ and AWG2020-06-26T23:13:17ZErik von ReisUnify gdstask.c between DAQ and AWGgdstask.c exists in src/daqd and src/gds. They are quite similar. They ought to be merged into a library.gdstask.c exists in src/daqd and src/gds. They are quite similar. They ought to be merged into a library.https://git.ligo.org/cds/software/advligorts/-/issues/140The daqd on x2daqtw0 is failing2020-09-01T23:22:03ZJonathan HanksThe daqd on x2daqtw0 is failing<pre>
Jul 23 16:49:46 x2daqtw0 daqd[12008]: Allocated move buffer size 51491864 bytes
Jul 23 16:49:46 x2daqtw0 daqd[12008]: 8
Jul 23 16:49:46 x2daqtw0 daqd[12008]: 9
Jul 23 16:49:46 x2daqtw0 daqd[12008]: 10
Jul 23 16:49:46 x2daqtw0 daqd[...<pre>
Jul 23 16:49:46 x2daqtw0 daqd[12008]: Allocated move buffer size 51491864 bytes
Jul 23 16:49:46 x2daqtw0 daqd[12008]: 8
Jul 23 16:49:46 x2daqtw0 daqd[12008]: 9
Jul 23 16:49:46 x2daqtw0 daqd[12008]: 10
Jul 23 16:49:46 x2daqtw0 daqd[12008]: 11
Jul 23 16:49:46 x2daqtw0 daqd[12008]: 12
Jul 23 16:49:47 x2daqtw0 daqd[12008]: 13
Jul 23 16:49:47 x2daqtw0 daqd[12008]: 14
Jul 23 16:49:47 x2daqtw0 daqd[12008]: 15
Jul 23 16:49:48 x2daqtw0 daqd[12008]: [Thu Jul 23 16:49:48 2020] Minute trender made GPS time correction; gps=1279576205; gps%60=5
Jul 23 16:50:45 x2daqtw0 daqd[12008]: [Thu Jul 23 16:50:45 2020] ->3: clear crc
Jul 23 17:11:10 x2daqtw0 daqd[12008]: Dropped data from shmem or received 0 dcus; gps now = 1279577487, 12; was = 1279577487, 10;
Jul 23 17:11:10 x2daqtw0 daqd[12008]: expected gps = 1279577487
Jul 23 17:11:10 x2daqtw0 daqd[12008]: expected cycle = 11
Jul 23 17:11:10 x2daqtw0 daqd[12008]: expected nano = 11
Jul 23 17:11:11 x2daqtw0 systemd[1]: rts-daqd.service: Main process exited, code=killed, status=11/SEGV
Jul 23 17:11:11 x2daqtw0 systemd[1]: rts-daqd.service: Failed with result 'signal'.
</pre>Jonathan HanksJonathan Hankshttps://git.ligo.org/cds/software/advligorts/-/issues/172Daqd broadcaster sending the wrong data2020-09-23T22:04:51ZJonathan HanksDaqd broadcaster sending the wrong dataThe daqd broadcaster was seen to send the wrong value for at least one channel during the recent lho upgrade.
The guardian state channel was stuck at a value for about a day, despite it changing. The input data stream to the daqd appea...The daqd broadcaster was seen to send the wrong value for at least one channel during the recent lho upgrade.
The guardian state channel was stuck at a value for about a day, despite it changing. The input data stream to the daqd appeared to have the correct value. A restart of the daqd fixed the issue.
Currently we do not know the cause of this issue.Jonathan HanksJonathan Hankshttps://git.ligo.org/cds/software/advligorts/-/issues/178In a current production setup we run the daqd only to get epics variables, th...2020-10-06T23:10:50ZJonathan HanksIn a current production setup we run the daqd only to get epics variables, these could be folded into the cps_recv processThe daqd on a data concentrator is not really needed, and restarting it doesn't impact the down stream data flow. We should move the diagnostics to the receiver process so that we don't need to run a daqd for that.The daqd on a data concentrator is not really needed, and restarting it doesn't impact the down stream data flow. We should move the diagnostics to the receiver process so that we don't need to run a daqd for that.Jonathan HanksJonathan Hankshttps://git.ligo.org/cds/software/advligorts/-/issues/188On a model crash local_dc exited with an error of 'free(): invalid pointer'2020-10-16T00:54:51ZJonathan HanksOn a model crash local_dc exited with an error of 'free(): invalid pointer'See FRS:
When the models crashed local_dc crashed as well. Notice there had been no messages in about the last month.
<pre>
Sep 17 14:05:03 h1susey local_dc[1572]: Model h1sustmsy rate = 16384 dcuid = 99
Sep 17 14:05:03 h1susey local_...See FRS:
When the models crashed local_dc crashed as well. Notice there had been no messages in about the last month.
<pre>
Sep 17 14:05:03 h1susey local_dc[1572]: Model h1sustmsy rate = 16384 dcuid = 99
Sep 17 14:05:03 h1susey local_dc[1572]: Model h1susetmypi rate = 65536 dcuid = 127
Oct 15 16:26:50 h1susey local_dc[1572]: free(): invalid pointer
Oct 15 16:26:50 h1susey systemd[1]: rts-local_dc.service: Main process exited, code=killed, status=6/ABRT
Oct 15 16:26:50 h1susey systemd[1]: rts-local_dc.service: Failed with result 'signal'.
Oct 15 16:26:55 h1susey systemd[1]: rts-local_dc.service: Service RestartSec=5s expired, scheduling restart.
Oct 15 16:26:55 h1susey systemd[1]: rts-local_dc.service: Scheduled restart job, restart counter is at 13.
Oct 15 16:26:55 h1susey systemd[1]: Stopped Advanced LIGO RTS local data concentrator.
Oct 15 16:26:55 h1susey systemd[1]: Started Advanced LIGO RTS local data concentrator.
</pre>
I am noting that cps_xmit also failed. Again note that we saw a large amount of time with no issues prior to this crash.
<pre>
Oct 08 16:35:11 h1susey cps_xmit[19800]: Client running refcount 1
Oct 15 16:26:50 h1susey cps_xmit[19800]: Closing client
Oct 15 16:26:50 h1susey cps_xmit[19800]: Closing client
Oct 15 16:26:50 h1susey cps_xmit[19800]: Closing out OpenMX and exiting
Oct 15 16:26:50 h1susey systemd[1]: rts-transport@cps_xmit.service: Main process exited, code=exited, status=1/FAILURE
Oct 15 16:26:50 h1susey systemd[1]: rts-transport@cps_xmit.service: Failed with result 'exit-code'.
Oct 15 16:26:55 h1susey systemd[1]: rts-transport@cps_xmit.service: Service RestartSec=5s expired, scheduling restart.
Oct 15 16:26:55 h1susey systemd[1]: rts-transport@cps_xmit.service: Scheduled restart job, restart counter is at 1.
Oct 15 16:26:55 h1susey systemd[1]: Stopped Advanced LIGO RTS DAQD transport.
Oct 15 16:26:55 h1susey systemd[1]: Started Advanced LIGO RTS DAQD transport.
Oct 15 16:26:55 h1susey cps_xmit[5413]: /usr/bin/cps_xmit compiled Aug 14 2020 : 16:21:37
Oct 15 16:26:55 h1susey cps_xmit[5413]: Writing DAQ data to local shared memory and sending out on the publisher
Oct 15 16:26:55 h1susey cps_xmit[5413]: ifo mapped to 0x7f5d7c541000
Oct 15 16:26:56 h1susey cps_xmit[5413]: Making mbuff area local_dc with size 104857600
Oct 15 16:26:56 h1susey cps_xmit[5413]: local_dc mmapped address is 0x7f5d7c541000
Oct 15 16:26:56 h1susey cps_xmit[5413]: Client running refcount 1
Oct 15 17:11:22 h1susey cps_xmit[5413]: Closing client
Oct 15 17:11:22 h1susey cps_xmit[5413]: Closing client
Oct 15 17:11:22 h1susey cps_xmit[5413]: Closing out OpenMX and exiting
</pre>Jonathan HanksJonathan Hankshttps://git.ligo.org/cds/software/advligorts/-/issues/186the daqd producer and some of the transmit code needs to do better parameter/...2020-11-11T01:41:46ZJonathan Hanksthe daqd producer and some of the transmit code needs to do better parameter/input handlingcorrupted data blocks could allow buffer overflows if things like the model/dcu count was bad or lengths were bad.corrupted data blocks could allow buffer overflows if things like the model/dcu count was bad or lengths were bad.Jonathan HanksJonathan Hankshttps://git.ligo.org/cds/software/advligorts/-/issues/222awgtpman: testpoint restrictions are antiquated2021-04-05T18:26:20ZErik von Reisawgtpman: testpoint restrictions are antiquatedtpman has some arbitrary limits on testpoint creation that should be eliminated, and some calculations that are sometimes incorrect. This issue perhaps properly should be split into several issues.
Everything is tracked with fixed-widt...tpman has some arbitrary limits on testpoint creation that should be eliminated, and some calculations that are sometimes incorrect. This issue perhaps properly should be split into several issues.
Everything is tracked with fixed-width buffers that should be std library data structures. Number of test points and number of users per testpoint are arbitrarily limited by the size of the buffers used to track them. Sizes of structures are dissociated from structures.
Total data rate calculation is incorrect for slow models < 16 kHz. So the ceiling on data rate is low by maybe a factor of 8.
There is no consideration of global data rate or where a model is. The limit of models in the cornerstation should be greater than from the end station (as long as the ends are combined into a sub-1G pipe). The limiting factor is how much traffic a link can handle *without causing missed daq epochs* for any model. The hard limit in code is probably too high for the ends and certainly too low for the corner.
Finally, and perhaps of no importance, there are nested loops used to check all these fixed structures. The performance of these checks could easily be improved.
Relevant code is in requesttp_1_svc in testpoint_server.chttps://git.ligo.org/cds/software/advligorts/-/issues/238Adjust how some transport programs detect live input buffers to treat them as...2021-06-04T17:10:17ZJonathan HanksAdjust how some transport programs detect live input buffers to treat them as read onlySome of the transmit programs (at least cps_xmit and dix_xmit) determine if a input buffer is live by writing an invalid cycle to the buffer and waiting for a valid cycle number. This is causes issues with some things (tracking buffers ...Some of the transmit programs (at least cps_xmit and dix_xmit) determine if a input buffer is live by writing an invalid cycle to the buffer and waiting for a valid cycle number. This is causes issues with some things (tracking buffers through restarts via mbuf_probe) and is a bad philosophy of doing this, the input can and should be read only. These should be changed to look for a cycle change to check for live buffers.Jonathan HanksJonathan Hankshttps://git.ligo.org/cds/software/advligorts/-/issues/239DAQD package should specify more precisely ldas-tools dependencies2021-06-09T16:14:07ZErik von ReisDAQD package should specify more precisely ldas-tools dependenciesRight now the dependency is not restricted by version, but some older packages no longer work with daqd.
Perhaps we should consider tightening all dependencies in all packages somewhat.Right now the dependency is not restricted by version, but some older packages no longer work with daqd.
Perhaps we should consider tightening all dependencies in all packages somewhat.https://git.ligo.org/cds/software/advligorts/-/issues/242"DAQ LOAD" button took down the whole host2021-06-16T16:38:06ZErik von Reis"DAQ LOAD" button took down the whole hostPressing "DAQ LOAD" on H1PEMMY crashed the whole front end.Pressing "DAQ LOAD" on H1PEMMY crashed the whole front end.https://git.ligo.org/cds/software/advligorts/-/issues/253Add [Install] section to rts-local_dc service2021-08-11T23:00:12ZArtemiy DmitrievAdd [Install] section to rts-local_dc serviceThere is no `[Install]` section in `/lib/systemd/system/rts-local_dc.service` currently. There is no reason not to have it, because this service is required by `daqd` and would normally start at boot.
- Steps to reproduce:
```
$ grep -A...There is no `[Install]` section in `/lib/systemd/system/rts-local_dc.service` currently. There is no reason not to have it, because this service is required by `daqd` and would normally start at boot.
- Steps to reproduce:
```
$ grep -A1 "\[Install\]" /lib/systemd/system/rts-local_dc.service
```
- Expected output:
```
[Install]
WantedBy=multi-user.target
```
- Actual output: nonehttps://git.ligo.org/cds/software/advligorts/-/issues/255Update cmake and code to build both buster and bullseye2021-08-18T22:25:04ZJonathan HanksUpdate cmake and code to build both buster and bullseyeKeith is working on the kernel side of things, however there are still minor userspace/cmake/... changes needed.Keith is working on the kernel side of things, however there are still minor userspace/cmake/... changes needed.Jonathan HanksJonathan Hankshttps://git.ligo.org/cds/software/advligorts/-/issues/233Modify RCG model make, install to work with linux kernel 5.10 / Debian bullseye2021-08-23T21:00:51ZKeith ThorneModify RCG model make, install to work with linux kernel 5.10 / Debian bullseyeUpdate the RCG model make, make install to work with linux kernel 5.10
known items
- update src/fe/timing.g to support change to 64-bit timekeepingUpdate the RCG model make, make install to work with linux kernel 5.10
known items
- update src/fe/timing.g to support change to 64-bit timekeepingKeith ThorneKeith Thorne