verilator

Author	SHA1	Message	Date
Wilson Snyder	b24d7c83d3	Copyright year update	2023-01-01 10:18:39 -05:00
github action	821dd070bf	Apply 'make format'	2022-11-23 09:08:02 +00:00
Yves Mathieu	06fdf7be58	Add support of Events for VCD/FST traces (#3759 )	2022-11-23 04:07:14 -05:00
Kamil Rakoczy	d6126c4b32	Remove --no-threads; require --threads 1 for single threaded (#3703 ).	2022-11-05 08:47:34 -04:00
Geza Lore	38a8d7fb2e	Remove redundant 'inline' keywords from definitions Also add checks to t/t_dist_cppstyle	2022-09-16 15:52:25 +01:00
Geza Lore	96a4b3e5a5	Update clang-format config and apply - Regroup and sort #include directives (like we used to, but automatic) - Set AlwaysBreakTemplateDeclarations to true	2022-08-05 12:00:24 +01:00
Wilson Snyder	a2d26b45bb	Internals: Fix some clang-tidy issues. No functional change intended.	2022-07-30 11:54:28 -04:00
Geza Lore	1d400dd98c	Configure tracing at run-time, instead of compile time (#3504 ) All remaining use of conditional compilation in the tracing implementation of the run-time library are replaced with the use of VerilatedModel::traceConfig, and is now done at run-time.	2022-07-20 11:27:10 +01:00
Geza Lore	a4ed3c2086	Make parallel tracing switchable at run-time	2022-07-19 17:13:13 +01:00
Geza Lore	efb5caad22	Improve robustness of trace configuration Always fail if adding a model to a trace file that has already executed a dump. We used to do this before as well, though in a less robust way. We will be relying on this property more in the future, so improve the check.	2022-07-19 14:16:08 +01:00
Geza Lore	db59c07f27	Implement trace offloading with fewer ifdefs Step towards a proper run-time library. Reduce the amount of ifdefs in the implementation of offloaded tracing. There are still a very small number of ifdefs left, which will need more careful changes in order to keep user API compatibility.	2022-07-19 11:31:35 +01:00
Geza Lore	9085e34d70	Pass VerilatedModel at trace registration time	2022-07-19 11:00:09 +01:00
Geza Lore	f4038e3674	Move thread pool and execution profiler into the context. (#3477 ) Fixes #3454	2022-07-12 11:41:15 +01:00
Geza Lore	b51f887567	Perform VCD tracing in parallel when using --threads (#3449 ) VCD tracing is now parallelized using the same thread pool as the model. We achieve this by breaking the top level trace functions into multiple top level functions (as many as --threads), and after emitting the time stamp to the VCD file on the main thread, we execute the tracing functions in parallel on the same thread pool as the model (which we pass to the trace file during registration), tracing into a secondary per thread buffer. The main thread will then stitch (memcpy) the buffers together into the output file. This makes the `--trace-threads` option redundant with `--trace`, which now only affects `--trace-fst`. FST tracing uses the previous offloading scheme. This obviously helps a lot in VCD tracing performance, and I have seen better than Amdahl speedup, namely I get 3.9x on XiangShan 4T (2.7x on OpenTitan 4T).	2022-05-29 19:08:39 +01:00
Geza Lore	c4b8675d77	Always inline some small, hot trace routines	2022-05-28 12:47:09 +01:00
Geza Lore	a7cd7a1ed9	Initialize VerilatedTrace members in class	2022-05-28 12:47:07 +01:00
Geza Lore	551bd284dd	Rename some internals related to multi-threaded tracing Rename the implementation internals of current multi-threaded tracing to be "offload mode". No functional change, nor user interface change intended.	2022-05-20 16:44:35 +01:00
Wilson Snyder	f6035447ae	Internals: Use mutable for mutexes. No functional change.	2022-05-13 07:21:39 -04:00
Wilson Snyder	e02f97854c	Deprecate 'vluint64_t' and similar types (#3255 ).	2022-03-27 15:27:40 -04:00
Wilson Snyder	321880f5a6	Add trace dumpvars() call for selective runtime tracing (#3322 ).	2022-03-05 15:44:32 -05:00
Wilson Snyder	50094ca296	Internals: Add cpplint control file and related cleanups	2022-01-09 16:49:38 -05:00
Wilson Snyder	ca42be982c	Copyright year update.	2022-01-01 08:26:40 -05:00
Geza Lore	ff425369ac	Reduce .rodata footprint of trace initialization (#3250 ) Trace initialization (tracep->decl* functions) used to explicitly pass the complete hierarchical names of signals as string constants. This contains a lot of redundancy (path prefixes), does not scale well with large designs and resulted in .rodata sections (the string constants) in ELF executables being extremely large. This patch changes the API of trace initialization that allows pushing and popping name prefixes as we walk the hierarchy tree, which are prepended to declared signal names at run-time during trace initialization. This in turn allows us to emit repeat path/name components only once, effectively removing all duplicate path prefixes. On SweRV EH1 this reduces the .rodata section in a --trace build by 94%. Additionally, trace declarations are now emitted in lexical order by hierarchical signal names, and the top level trace initialization function respects --output-split-ctrace.	2021-12-19 15:15:07 +00:00
Wilson Snyder	ab13a2ebdc	Internals: Use C++11 const and initializers. No functional change intended.	2021-07-24 08:36:11 -04:00
Àlex Torregrosa	2b2680770b	Improve scope types in FST and VCD traces (#2805 ).	2021-04-07 09:55:11 -04:00
Wilson Snyder	ca01d6f18d	Internals: Add some std::'s. No functional change intended.	2021-03-26 21:23:18 -04:00
Wilson Snyder	2e158d88c1	Commentary. Remove dox comments from private members,	2021-03-20 21:11:53 -04:00
Wilson Snyder	a1ab295b74	Commentary: Cleanup all include/* header comments.	2021-03-20 17:46:00 -04:00
Wilson Snyder	3a55600913	Internals: Restyle with C++11 using replacing typedef	2021-03-12 18:10:45 -05:00
Wilson Snyder	2cad22a22a	Add simulation context (VerilatedContext) (#2660 ). (#2813 ) Add simulation context (VerilatedContext) to allow multiple fully independent models to be in the same process. Please see the updated examples. Add context->time() and context->timeInc() API calls, to set simulation time. These now are recommended in place of the legacy sc_time_stamp().	2021-03-07 11:01:54 -05:00
Wilson Snyder	caa9c99837	Commentary	2021-03-07 08:28:13 -05:00
Wilson Snyder	8c3ad591ae	Internals: Add additional mutex exclusion checks. No functional change.	2021-03-06 18:29:11 -05:00
Wilson Snyder	be31fdcfe4	Use Google-style-guide header guard naming, to avoid __ prefix.	2021-03-03 21:57:07 -05:00
Wilson Snyder	9650aefa42	Internals: Cleanup unneeded {}. No functional change	2021-02-21 21:25:21 -05:00
Àlex Torregrosa	e77e4e1fe6	Improve struct scopes when dumping structs to VCD (#2776 )	2021-02-03 14:40:21 -05:00
Wilson Snyder	bd602d0e2d	Copyright year update	2021-01-01 10:29:54 -05:00
Wilson Snyder	7d05be802d	Misc internal coverage hole and related bug fixes	2020-12-09 19:18:12 -05:00
Wilson Snyder	b6ded59c2b	Internals: Use and enforce class final for ~5% performance boost.	2020-11-18 21:32:16 -05:00
Wilson Snyder	78aee6f4e7	C++11: Use sized enums (+4% performance).	2020-08-16 12:05:35 -04:00
Wilson Snyder	72d2cff0a1	C++11: Use member declaration initalizations. No functional change intended.	2020-08-16 11:44:06 -04:00
Geza Lore	fac89c5d62	Close trace on vl_fatal/vl_finish (#2414 ) This is required to get the last bit of FST trace and close the FST file properly on $stop or assertion failure.	2020-06-12 07:15:42 +01:00
Wilson Snyder	773ed97504	Internals Most VerilatedLockGuard can be const. No functional change intended.	2020-05-28 18:23:46 -04:00
Wilson Snyder	6a882f9dc6	Internal code coverage improvements. No functional change intended.	2020-05-23 10:34:58 -04:00
Geza Lore	53cde90c8f	De-constify fields to appease old GCC with broken STL No functional change intended, fixes #2342	2020-05-18 19:01:08 +01:00
Wilson Snyder	ed4c7038b4	Tracing: Remove dead code. No functional change intended.	2020-05-17 09:53:58 -04:00
Wilson Snyder	c4f31d3bb6	Tracing: Remove dead code. No functional change intended.	2020-05-17 09:52:03 -04:00
Geza Lore	900c023bb5	Refactor trace implementation to allow experimentation The main goal of this patch is to enable splitting the full and incremental tracing functions into multiple functions, which can then be run in parallel at a later stage. It also simplifies further experimentation as all of the interesting trace code construction now happens in V3Trace. No functional change is intended by this patch, but there are some implementation changes in the generated code. Highlights: - Pass symbol table directly to trace callbacks for simplicity. - A new traceRegister function is generated which adds each trace function as an individual callback, which means we can have multiple callbacks for each trace function type. - A new traceCleanup function is generated which clears the activity flags, as the trace callbacks might be implemented as multiple functions. - Re-worked sub-function handling so there is no separate sub-function for each trace activity class. Sub-functions are generate when required by splitting. - traceFull/traceChg are now created in V3Trace rather than V3TraceDecl, this requires carrying the trace value tree in TraceDecl until it reaches V3Trace where the TraceInc nodes are created (previously a TraceInc was also created in V3TraceDecl which carries the value).	2020-05-15 18:34:29 +01:00
Geza Lore	aa9cde22c8	Use SIMD intrinsics to render VCD traces (#2289 ) Use SIMD intrinsics to render VCD traces. I have measured 10-40% single threaded performance increase with VCD tracing on SweRV EH1 and lowRISC Ibex using SSE2 intrinsics to render the trace. Also helps a tiny bit with FST, but now almost all of the FST overhead is in the FST library. I have reworked the tracing routines to use more precisely sized arguments. The nice thing about this is that the performance without the intrinsics is pretty much the same as it was before, as we do at most 2x as much work as necessary, but in exchange there are no data dependent branches at all.	2020-04-30 00:09:09 +01:00
Geza Lore	dd967f7769	Improve trace buffer memory utilization and performance. Convert trace buffer to 32-bit entries, rather than a union containing a pointer type. Also tweaked trace entry layouts for a bit more performance. This gains another 10% on SweRV EH1 CoreMark.	2020-04-27 19:00:17 +01:00
Geza Lore	b79ef672e1	Various minor optimizations of VCD trace routines - Change templated trace routines to branch table. Removed templating from trace chgBus and fullBus and replaced them with a branch table like the other there is a very small (< 1%) penalty for this on SwerRV EH1 CoreMark, but this is less than the variability of disk IO so it's worth it to keep the code simpler and smaller. - Prefetch VCD suffix buffer at the top of emit* - Increase ILP in VCD emit* routines - Use a 64-bit unaligned store to emit the VCD suffix (on x86 only) The performance difference with these is very small, but the changes hopefully make this code more performance-portable across various micro-architectures.	2020-04-27 18:44:53 +01:00

1 2

52 Commits