verilator

mirror of https://github.com/verilator/verilator.git synced 2025-01-01 12:17:35 +00:00

Author	SHA1	Message	Date
Wilson Snyder	91dd3c5fac	Support 1800-2023 keywords.	2024-03-02 10:15:19 -05:00
Geza Lore	f56f318217	Make installation relocatable, and the installation testable (#4927 ) Fixes #4893	2024-03-01 00:08:28 +00:00
Yutetsu TAKATSUKASA	51ae8e13fb	Add --assert-case option (#4919 )	2024-02-23 23:05:53 +09:00
Yutetsu TAKATSUKASA	a951446f9b	Add --[no]-stop-fail option (#4904 ) Co-authored-by: Wilson Snyder <wsnyder@wsnyder.org>	2024-02-19 21:26:58 +09:00
Szymon Gizler	d667b73e8d	Add --json-only and related JSON dumping (#4715 ) (#4831 ).	2024-02-09 17:50:09 -05:00
Szymon Gizler	c715cfc254	Add --valgrind switch (#4828 )	2024-01-29 07:50:05 -05:00
Wilson Snyder	d4c8a15407	Add --runtime-debug for Verilated executable runtime debugging.	2024-01-25 07:34:30 -05:00
Wilson Snyder	354a534d68	Add '--decorations node' for inserting debug comments into emitted code.	2024-01-24 21:51:47 -05:00
Wilson Snyder	c8a40e0b52	Commentary; Tests: Check summary options match guide options	2024-01-24 19:28:46 -05:00
Wilson Snyder	e76f29e5ba	Copyright year update	2024-01-01 03:19:59 -05:00
Geza Lore	4c0edd2efb	Improve --prof-exec infrastructure and report Again --prof-exec have bit-rotted a little with all the recent changes to the structure of the generated code. This patch contains a few improvements: - Repalce the eval/evl_loop begin/end events with generic section_push/section_pop events, that can be arbitrarily sprinkled into the generate code (so long as they are matched correctly) to measure various sections. The report then contains a nested profile of the sections, and the VCD trace shows the section names. - Better handling of exec graphs - Clearer overall statistics	2023-10-21 21:09:03 +01:00
Frans Skarman	e9cc2786b7	Add --no-trace-top option (#4422 )	2023-08-19 04:51:29 -04:00
Don Williamson	df2746de71	Add --main-top-name option for C main TOP name (#4235 ) (#4249 )	2023-05-31 09:02:26 -07:00
Andrew Nolte	13c9877099	Add --public-params flag (#3990 )	2023-03-08 19:38:26 -05:00
Wilson Snyder	b778784333	Add --annotate-points option, change multipoint on line reporting (#3876 ).	2023-02-08 20:22:54 -05:00
Wilson Snyder	807e5b22a0	Fix verilator_difftree header flush	2023-02-05 15:20:56 -05:00
Aleksander Kiryk	31130c4b4a	Fix std:: to be parsed first (#3864 ) (#3928 )	2023-02-03 09:04:16 -05:00
Kamil Rakoczy	6ea725f479	Add --verilate-jobs option (#3889 ) Currently this option isn't used, but in the future it will be used to specify parallelization of Verilation step.	2023-01-22 21:52:52 -05:00
Wilson Snyder	bc7048e8d1	Convert verilator_includer to python3	2023-01-21 14:40:22 -05:00
Wilson Snyder	3fe81a3832	Add manpages for missing user commands (using help2man)	2023-01-17 19:26:12 -05:00
Wilson Snyder	b24d7c83d3	Copyright year update	2023-01-01 10:18:39 -05:00
Wilson Snyder	c0499da28b	Spelling fixes	2022-12-23 11:32:38 -05:00
Aleksander Kiryk	c2b09e35f8	Support unpacked structs (#3802 )	2022-12-20 19:22:42 -05:00
Wilson Snyder	972a11537c	Internals: Fix lint-py warnings	2022-12-11 21:58:02 -05:00
Wilson Snyder	b6cdae30f6	docs: Fix grammar.	2022-12-10 20:09:47 -05:00
Mariusz Glebocki	d0e7177d8e	Disable stack size limit (#3706 ) (#3751 )	2022-11-19 14:44:54 -05:00
Mariusz Glebocki	dc28e7f3e2	Commentary. Remove repeated entry in Verilator help. (#3754 )	2022-11-16 05:32:32 -08:00
Wilson Snyder	7e1b92fa75	Add --get-supported to determine what features are in Verilator (#3688 ).	2022-10-20 21:42:30 -04:00
Wilson Snyder	b92173bf3d	Add --binary option as alias of --main --exe --build (#3625 ).	2022-09-28 09:04:33 -04:00
Geza Lore	47bce4157d	Introduce DFG based combinational logic optimizer (#3527 ) Added a new data-flow graph (DFG) based combinational logic optimizer. The capabilities of this covers a combination of V3Const and V3Gate, but is also more capable of transforming combinational logic into simplified forms and more. This entail adding a new internal representation, `DfgGraph`, and appropriate `astToDfg` and `dfgToAst` conversion functions. The graph represents some of the combinational equations (~continuous assignments) in a module, and for the duration of the DFG passes, it takes over the role of AstModule. A bulk of the Dfg vertices represent expressions. These vertex classes, and the corresponding conversions to/from AST are mostly auto-generated by astgen, together with a DfgVVisitor that can be used for dynamic dispatch based on vertex (operation) types. The resulting combinational logic graph (a `DfgGraph`) is then optimized in various ways. Currently we perform common sub-expression elimination, variable inlining, and some specific peephole optimizations, but there is scope for more optimizations in the future using the same representation. The optimizer is run directly before and after inlining. The pre inline pass can operate on smaller graphs and hence converges faster, but still has a chance of substantially reducing the size of the logic on some designs, making inlining both faster and less memory intensive. The post inline pass can then optimize across the inlined module boundaries. No optimization is performed across a module boundary. For debugging purposes, each peephole optimization can be disabled individually via the -fno-dfg-peepnole-<OPT> option, where <OPT> is one of the optimizations listed in V3DfgPeephole.h, for example -fno-dfg-peephole-remove-not-not. The peephole patterns currently implemented were mostly picked based on the design that inspired this work, and on that design the optimizations yields ~30% single threaded speedup, and ~50% speedup on 4 threads. As you can imagine not having to haul around redundant combinational networks in the rest of the compilation pipeline also helps with memory consumption, and up to 30% peak memory usage of Verilator was observed on the same design. Gains on other arbitrary designs are smaller (and can be improved by analyzing those designs). For example OpenTitan gains between 1-15% speedup depending on build type.	2022-09-23 16:46:22 +01:00
Geza Lore	ddb678cc5b	Merge branch 'master' into develop-v5	2022-09-22 17:33:36 +01:00
Geza Lore	63c694f65f	Streamline dump control options - Rename `--dump-treei` option to `--dumpi-tree`, which itself is now a special case of `--dumpi-<tag>` where tag can be a magic word, or a filename - Control dumping via static `dump*()` functions, analogous to `debug()` - Make dumping independent of the value of `debug()` (so dumping always works even without the debug flag) - Add separate `--dumpi-graph` for dumping V3Graphs, which is again a special case of `--dumpi-<tag>` - Alias `--dump-<tag>` to `--dumpi-<tag> 3` as before	2022-09-22 17:24:41 +01:00
Wilson Snyder	d162619bd3	Merge branch 'master' into develop-v5	2022-09-20 20:06:21 -04:00
Wilson Snyder	fc4ffd454e	Rename --bin to --build-dep-bin.	2022-09-18 10:32:43 -04:00
Wilson Snyder	1234ee5fd2	Merge branch 'master' into develop-v5	2022-09-17 08:02:25 -04:00
Wilson Snyder	80b73859a2	Commentary: Some fixes from 'develop-v5'	2022-09-17 08:00:40 -04:00
Geza Lore	af305bf280	Merge branch 'master' into develop-v5	2022-09-16 16:24:36 +01:00
Wilson Snyder	ab6e1c2399	Commentary on --main	2022-09-15 20:26:08 -04:00
Kamil Rakoczy	da20da264b	Add --build-jobs, and rework arguments for -j (#3623 )	2022-09-15 08:28:58 -04:00
Geza Lore	27031ed688	Merge branch 'master' into develop-v5	2022-09-15 10:28:35 +01:00
Wilson Snyder	75fd71d7e5	Add --main to generate main() C++ (previously was experimental only) (#3265 ).	2022-09-14 20:18:40 -04:00
Wilson Snyder	361cef4633	Fix pylint warnings.	2022-09-07 21:48:52 -04:00
Krzysztof Bieganski	39af5d020e	Timing support (#3363 ) Adds timing support to Verilator. It makes it possible to use delays, event controls within processes (not just at the start), wait statements, and forks. Building a design with those constructs requires a compiler that supports C++20 coroutines (GCC 10, Clang 5). The basic idea is to have processes and tasks with delays/event controls implemented as C++20 coroutines. This allows us to suspend and resume them at any time. There are five main runtime classes responsible for managing suspended coroutines: * `VlCoroutineHandle`, a wrapper over C++20's `std::coroutine_handle` with move semantics and automatic cleanup. * `VlDelayScheduler`, for coroutines suspended by delays. It resumes them at a proper simulation time. * `VlTriggerScheduler`, for coroutines suspended by event controls. It resumes them if its corresponding trigger was set. * `VlForkSync`, used for syncing `fork..join` and `fork..join_any` blocks. * `VlCoroutine`, the return type of all verilated coroutines. It allows for suspending a stack of coroutines (normally, C++ coroutines are stackless). There is a new visitor in `V3Timing.cpp` which: * scales delays according to the timescale, * simplifies intra-assignment timing controls and net delays into regular timing controls and assignments, * simplifies wait statements into loops with event controls, * marks processes and tasks with timing controls in them as suspendable, * creates delay, trigger scheduler, and fork sync variables, * transforms timing controls and fork joins into C++ awaits There are new functions in `V3SchedTiming.cpp` (used by `V3Sched.cpp`) that integrate static scheduling with timing. This involves providing external domains for variables, so that the necessary combinational logic gets triggered after coroutine resumption, as well as statements that need to be injected into the design eval function to perform this resumption at the correct time. There is also a function that transforms forked processes into separate functions. See the comments in `verilated_timing.h`, `verilated_timing.cpp`, `V3Timing.cpp`, and `V3SchedTiming.cpp`, as well as the internals documentation for more details. Signed-off-by: Krzysztof Bieganski <kbieganski@antmicro.com>	2022-08-22 13:26:32 +01:00
Wilson Snyder	b25b798dbe	Merge branch 'master' into develop-v5	2022-07-04 13:20:03 -04:00
Wilson Snyder	fa99cbbc73	Commentary: Fix mis-sorted option names. No functional change.	2022-06-21 19:28:26 -04:00
Wilson Snyder	0f324c8309	Merge branch 'master' into develop-v5	2022-06-04 11:59:49 -04:00
Wilson Snyder	67f7432dd7	Commentary (#3436 ).	2022-06-04 08:37:42 -04:00
Wilson Snyder	ada58465b2	Add -f<optimization> options to replace -O<letter> options (#3436 ).	2022-06-03 20:43:16 -04:00
Wilson Snyder	173f57c636	Changed --no-merge-const-pool to -fno-merge-const-pool (#3436 ).	2022-06-03 19:41:59 -04:00
Geza Lore	b51f887567	Perform VCD tracing in parallel when using --threads (#3449 ) VCD tracing is now parallelized using the same thread pool as the model. We achieve this by breaking the top level trace functions into multiple top level functions (as many as --threads), and after emitting the time stamp to the VCD file on the main thread, we execute the tracing functions in parallel on the same thread pool as the model (which we pass to the trace file during registration), tracing into a secondary per thread buffer. The main thread will then stitch (memcpy) the buffers together into the output file. This makes the `--trace-threads` option redundant with `--trace`, which now only affects `--trace-fst`. FST tracing uses the previous offloading scheme. This obviously helps a lot in VCD tracing performance, and I have seen better than Amdahl speedup, namely I get 3.9x on XiangShan 4T (2.7x on OpenTitan 4T).	2022-05-29 19:08:39 +01:00

1 2 3 4 5 ...

698 Commits