Jeff Normile: Research Notebook #7

jnormile · 2023-09-13T22:37:44Z

jnormile
Sep 13, 2023

This space will primarily be used to track the outcomes of twice-weekly discussions with my first reader--Professor Kapfhammer--regarding the 2023-2024 SE Senior Comp. It may also contain details pertaining to outside research materials not directly discussed in the meetings otherwise outlined in this space as well as links to GitHub repositories with related tooling (or technical blog posts discussing said tools).

jnormile · 2023-09-14T13:19:08Z

jnormile
Sep 14, 2023
Author

To begin, I'll recap an ongoing conversation with my first reader that took place over the course of the past week and a half.

Generating an SE Comp Idea

Monday, Sept. 4

My first meeting with Professor Kapfhammer--who will simply be referred to as GK going forward for the sake of keystrokes--primarily saw me sharing my initial senior comp idea: a browser-based distance teaching platform for imparting programming fundamentals (specifically through the Rust programming language). The intended audience for the platform was adult distance learners, and the main motivation behind the idea was to explore both the impact of gamification (which as a research body is riddled with gaps) and the utility of Rust as an introductory language (despite its reputation for having a steep onboard) while producing a hopefully useful learning platform.

Thinking about what separates a Software Engineering comp from a Computer Science comp was the focus of the conversation, and we ultimately determined that the idea--while it certainly had merit as a Computer Science comp--wasn't particularly focused on the desired outcomes for a Software Engineering comp.

More specifically, GK stated--and I agreed--that a project emblematic of Software Engineering would have broad utility and modularity (i.e., it would be built for re-use). We agreed that an appropriate project would focus more on the software artifact itself (and the engineering process that created) and less on experimental evaluation, which would've been at the forefront of the original proposed idea.

In short, it was back to the drawing board for me.

Thursday, September 7

After spending some time mulling over my predicament--I didn't have a suitable comp idea!--I met with GK to pitch a new idea: an evaluation tool that would be situated at the top of a monorepo containing all assignments (via Git submodules) for a given course and run customized test suites to not only provide grades, but other metrics that might interest an instructor. Somehow, on paper this idea felt like it had novelty, but the more I discussed it with GK and developed the idea, the greater and more obvious the overlap between my proposed tool and existent tools like gatorgrade became.

We affirmed that another important part of a Software Engineering comp was novelty (or as close to novelty as we could get). In order for a software tool to have meaningful utility, it had to do something that wasn't being already being done (or at least not already being done in the Allegheny CIS department).

I was clearly interested in creating an artifact that had some educational utility, and GK suggested that there had to be some unique and novel tooling need that might support a fully developed learning platform (which is more or less what I had presented, albeit in two distinct ways). With his suggestion in mind, I went back to the drawing board. Again.

Tuesday, September 14

In my third meeting with GK I presented my third comp idea, one that seemed like it would have a novel and broadly applicable utility.

Following my prior meeting, I had been thinking about abstract syntax tree parsing (a topic in one of the courses I'm currently TLing) as well as what makes certain code complex (or not complex)--a topic that had been in the back of my mind throughout the duration of a summer internship that saw me puzzle through highly complex and difficult-to-maintain code that hadn't been touched in some years.

I had done some reading on Halstead metrics, McCabe's cyclomatic complexity, and similar efforts to measure code complexity--such as SonarQube's relatively recent cognitive complexity metric. It was this last metric that had piqued my interest, and the way that the measurement defined increasingly nested code as being increasingly complex. Even then, the complexity SonarQube was measuring was--for lack of a better word--"macro"; it only measured distinct branches of logic across many lines of code.

I've never quite been able to wrap my head around lambda expressions in Python. Or list comprehensions for that matter. These often single-line pieces of code are highly complex and--to me anyway (and I assume many introductory programmers)--horribly opaque. But these expressions would have the same level of complexity as a print statement according to SonarQube, Halstead, or whatever other established metric you consulted.

Thinking about AST parsing, I wondered if SonarQube's tactic of counting levels of nestedness could be applied on a more "micro" scale--that is, if we could measure complexity by counting the individual "pieces" of logic that were nested within other pieces of logic, so to speak. If this seems horribly abstract, I'll provide a more concrete example of this in a future post (this one's getting lengthy).

Here's what I propose: we utilize a tool like tree-sitter that can parse ASTs and use it to measure "micro-complexity". This measurement could then be leveraged in an IDE extension to provide syntax highlighting for over-complex code, or possibly to provide some other data-driven utility (such as an instructor plotting the complexity of their assignments over a semester to ensure an even uptick in complexity).

Thankfully, GK agreed that this idea perfectly represents a Software Engineering comp idea.

So now, we start building. (And reading and researching.)

0 replies

jnormile · 2023-09-14T14:47:14Z

jnormile
Sep 14, 2023
Author

This first "research dump" contains a list of relevant articles and research resources pertaining to the topic of Existing Code Complexity Metrics.

Research Dump No. 1: Existing Code Complexity Metrics

Halstead metrics, the "go-to" measurement of code complexity: https://en.wikipedia.org/wiki/Halstead_complexity_measures
Critique of Halstead and established metrics: https://arxiv.org/pdf/2012.12324.pdf (Will need to further investigate)
SonarQube approach to measuring "cognitive complexity" (inspiration for comp idea): https://www.sonarsource.com/docs/CognitiveComplexity.pdf
"Complexity calculator" using Halstead metrics: https://www.proquest.com/docview/2513025707?accountid=8268&pq-origsite=Summon&parentSessionId=AqOV%2B32qAEKQ70VmZZpDs971iyOxkPx6KJnBtRG%2FWXM%3D
"Lines of code" complexity measurement that focuses on nesting: https://www.sciencedirect.com/science/article/pii/S0167642309000379?via%3Dihub
Exploration of Beginning Student Language (BSL)--of interest if framing micro-complexity as having added value for introductory programming courses: https://htdp.org/2023-8-14/Book/i1-2.html
Neuroscience-based approach to evaluating complexity: https://www.frontiersin.org/articles/10.3389/fnins.2022.1065366/full
Software metrics across multiple languages (useful for considering what plexity adds in terms of multi-language support): https://repositorio-aberto.up.pt/bitstream/10216/132618/2/447340.pdf
More metrics across multiple languages: https://www.researchgate.net/publication/295087429_How_to_Calculate_Software_Metrics_for_Multiple_Languages_Using_Open_Source_Parsers

0 replies

jnormile · 2023-09-15T17:57:11Z

jnormile
Sep 15, 2023
Author

Thursday, September 14

My latest meeting with GK saw us spend some time to clearly define expectations related to the SE comp. Those expectations are outlined below, where they will be updated as they continue to be defined over the course of the comp development period.

SE Requirements

I'll be using this section to track agreed-upon criteria for an SE comp that might not otherwise be explicitly stated in the CMPSC comp syllabus.

README-driven development (https://tom.preston-werner.com/2010/08/23/readme-driven-development.html) that leverages the README to not only share basic technical how-to's for artifact, but also detail the software engineering process itself. This means the README should contain:
- User stories / requirement gathering documentation
- Greater detail pertaining to supporting tooling (why those tools/libraries were selected, how they support the artifact)
- Images & text describing the overall design of artifact implementation--essentially an artifact "blueprint"
Robust CI/CD workflow via GitHub Actions:
- Workflow should ensure working build that doesn't produce an error
- Workflow should incorporate industry-standard linters to ensure code meets certain quality thresholds
- Workflow should run test cases--ideally in a randomized order to ensure that there are no incidental dependencies between test cases

0 replies

jnormile · 2023-09-19T13:23:18Z

jnormile
Sep 19, 2023
Author

Unlike my above "research dump", which is used to collect research articles and academic sources, this "technical reference dump" is more interested in technical writing (like tech blog posts, library documentation, etc.) that I suspect will aid me in putting together the software artifact for my comp.

Technical Reference Dump No. 1: Using `tree-sitter`

Haobo's blog (using tree-sitter in Rust): https://haobogu.github.io/posts/code-intelligence/tree-sitter/
tree-sitter "Getting Started" documentation: https://tree-sitter.github.io/tree-sitter/using-parsers#getting-started
Teknologi Umum blog (outside discussion on tree-sitter use cases): https://teknologiumum.com/posts/introductory-to-treesitter
Rust bindings for tree-sitter: https://docs.rs/tree-sitter/latest/tree_sitter/
bat library for syntax highlighting in Rust: https://github.com/sharkdp/bat
https://derek.stride.host/posts/comprehensive-introduction-to-tree-sitter

0 replies

jnormile · 2023-09-19T21:05:35Z

jnormile
Sep 19, 2023
Author

Tuesday, September 19

My fifth meeting with GK saw me sharing my progress in implementing the comp, with the most notable piece to share being a feasibility demo of a Rust implementation of tree-sitter parsing a (hyper-simplistic) snippet of JSON code. This has been--in my book anyway--the first concrete step towards delivering something that might someday resemble a successful senior comp!

Our discussion was brief, but we agreed that the next step was implementing file-reading so that the input could for the parser could be more than a hard-coded snippet of code. Notable steps that I'll want to start investigating (and eventually implementing) will include syntax highlighting and automatic language detection.

0 replies

jnormile · 2023-09-22T16:53:21Z

jnormile
Sep 22, 2023
Author

Research Dump No. 2: In Defense of `tree-sitter`

As an SE comp, part of the work I'll have to tackle is justifying and defending the use of major supporting libraries. So far, the big tool that I'm depending on is tree-sitter, an AST parser that supports a very wide spread of programming languages (perhaps its biggest selling point for me as I try to develop a tool that can measure nested logic in as wide a variety of languages as possible). The below sources (primarily academic papers) were found within the main documentation for tree-sitter, and offer a spread of defenses/justifications for the design and implementation of tree-sitter:

Practical Algorithms for Incremental Software Development Environments: https://www2.eecs.berkeley.edu/Pubs/TechRpts/1997/CSD-97-946.pdf
Context Aware Scanning for Parsing Extensible Languages: https://www-users.cse.umn.edu/~evw/pubs/vanwyk07gpce/vanwyk07gpce.pdf
Efficient and Flexible Incremental Parsing: https://harmonia.cs.berkeley.edu/papers/twagner-parsing.pdf
Incremental Analysis of Real Programming Languages: https://harmonia.cs.berkeley.edu/papers/twagner-glr.pdf
Error Detection and Recovery in LR Parsers: https://what-when-how.com/compiler-writing/bottom-up-parsing-compiler-writing-part-13
Error Recovery for LR Parsers: https://apps.dtic.mil/sti/pdfs/ADA043470.pdf

0 replies

jnormile · 2023-09-26T13:52:48Z

jnormile
Sep 26, 2023
Author

Technical Reference Dump No. 2: Language Detection

One of the primary reasons that I'm leveraging tree-sitter for this comp is that it supports an incredibly wide array of programming languages. In order to make use of this wide language support however, my application must similarly support a wide variety of programming languages. One of the requisites for this is going to involve--at some level--some programming language detection. (As a fail-safe, I could also simply write my tool to require a language be manually selected by the user, but this seems like it would limit future applications of the tool, such as syntax highlighting.)

These reference materials highlight documentation I've dug up related to anything pertaining to programming language detection:

GitHub citation on their approach to language detection: https://docs.github.com/en/repositories/managing-your-repositorys-settings-and-features/customizing-your-repository/about-repository-languages
linguist (a Ruby library used by GitHub): https://github.com/github-linguist/linguist
hyperpolyglot (a Rust library inspired by Linguist; not particularly well maintained): https://github.com/monkslc/hyperpolyglot
Further hyperpolyglot documentation: https://crates.io/crates/hyperpolyglot
syntect (a library for syntax highlighting w/ wide language support--may be able to simply use their language detection): https://github.com/trishume/syntect
Further syntect documentation: https://docs.rs/syntect/latest/syntect/parsing/struct.SyntaxSet.html

0 replies

jnormile · 2023-09-26T14:35:38Z

jnormile
Sep 26, 2023
Author

Technical Reference Dump No. 3: GitHub Actions

I'm not particularly savvy when it comes to CI/CD--a key part of the software engineering process, and one that I'll need to tackle while building the SE comp. This technical reference dump simply contains documentation that I may use to cobble together my GitHub Actions flow.

"Getting Started with GitHub Actions for Rust": https://dev.to/rogertorres/getting-started-with-github-actions-for-rust-1o6g

0 replies

jnormile · 2023-09-28T14:36:32Z

jnormile
Sep 28, 2023
Author

Tuesday, September 26

Another meeting with GK; this particular session I highlighted some of my findings regarding programming language detection, and suggested that this particular feature might be beyond the scope of at least the prototype--a sentiment that GK agreed with. Instead, we decided it'd be appropriate to outfit the program with the capability to parse an additional CLI argument, requiring the user to manually select a language to parse an AST for (rather than automatically detecting this based on the input file selected). A little clunky, but it gets the job done and keeps me from being over-focused on a nice perk rather than the main focus of the comp.

We also discussed some additional content to consider adding to my GitHub Actions workflow (which is currently quite bare), and had a brief discussion on selecting an appropriate license for the comp repo. The take-aways from the former discussion have been added into an earlier discussion post about requirements for the SE comp.

0 replies

jnormile · 2023-10-03T13:19:54Z

jnormile
Oct 3, 2023
Author

Friday, September 29

This is more of an aside, and not directly related to the development of the Senior Comp, but this date marked the Alumin Panel and social at Alden. The panel itself was interesting (if Luis isn't involved in a radio show or podcast then his voice is being wasted) and I spent a large chunk of the social talking with alum Noor about his experience the past year at NetApp. We swapped stories about the respective development cultures of the companies we had worked with (myself drawing from my internship experience at Auto-Owners Insurance).

0 replies

jnormile · 2023-10-05T13:34:23Z

jnormile
Oct 5, 2023
Author

Tuesday, October 3

After a lengthy meeting with GK primarily focused on brainstorming how to experimentally evaluate my proposed complexity metric, I was left with several big-picture questions to mull over. Namely, should I lean on IRB-approved human research, or develop an additional software tool that can be utilized to do some evaluation via repository mining? Or (if I'm feeling particularly ambitious) some combination of the two?

I'm still not sure which way to lean, but there is a pressing deadline coming up (Nov 1, the due date for IRB proposals) that I'll have to keep in mind as I consider which route is best for this project.

0 replies

jnormile · 2023-10-12T20:29:16Z

jnormile
Oct 12, 2023
Author

Thursday, October 12

Today's meeting with GK had us defining some of the next steps for the comp. We concretely laid out the "story" or selling points to my comp idea in preparation for upcoming pitch presentations, and discussed engineering to-dos that will bring my artifact to a suitable place for said pitch presentation.

In broad strokes, these to-dos include:

Cleaning up the README/documentation (include setup/running instructions)
Building on existing workflow actions to include basic linting and some semblance of testing (even minimal coverage; will need to do some extra reading on testing conventions for Rust)
Working out bug with current node depth measurement (so that entire node and all its stored data can be utilized for "round trip" parsing, as opposed to current working s-expression implementation)
Look at calculating other existing complexity measurements with my tool for easy comparison with my proposed view on complexity: Halstead, McCabe, and raw lines of code (LOC)

0 replies

jnormile · 2023-10-26T22:59:45Z

jnormile
Oct 26, 2023
Author

Thursday, October 26

Today's meeting with GK had us briefly reviewing content for my pitch presentation tomorrow, and start in on brainstorming the next action item to tackle: measuring cyclomatic complexity. I proposed using the existing AST traversal to grab nodes that might have conditional logic embedded in them, and cross-referencing existing documentation on measuring cyclomatic complexity to determine what types of nodes increment complexity.

We agree that implementing this measurement across more than one programming language will likely be a bit of a bear to implement, and after discussing a plan of action today, we devoted our next meeting (on 10/31) to spending the time working together to start to identify cyclomatic complexity incrementing nodes in one language (e.g., Python), and starting the process of comparing measurements (and node identifiers) across other language grammars.

0 replies

jnormile · 2023-11-01T22:19:35Z

jnormile
Nov 1, 2023
Author

Tuesday, October 31

In my meeting with GK today, we spent some time talking about what the written comp chapters for an SE comp should consider and tackle (as compared to what's laid out for the CS comp chapters); specifically, I'll want to keep in mind actual software artifacts (ideally both open source and enterprise-level) that I can draw on for my related work section. We also discussed some long-term ambitions for the plexity project, which were particularly exciting to think about!

(We admittedly got a little sidetracked during this meeting and spent a good chunk of time talking about a side project--an evaluation tool related to measuring data pertaining to learning objectives in the CIS department.)

0 replies

jnormile · 2023-11-14T14:24:50Z

jnormile
Nov 14, 2023
Author

Tuesday, November 7

I realized after the fact that I forgot to talk about my last session with GK, so here's a (belated) entry summarizing our time together. As has recently been the trend, a fair chunk of our time was spent excitedly (and distractedly) hashing out thoughts and ideas for the department program outcomes evaluation tool. (An important conversation, but one that's not remotely related to the comp itself.)

For the time that we did spend discussing the comp, I demonstrated plexity's latest new capability: measuring cyclomatic complexity for Python programs! We talked about the hurdles of translating this to other languages (which there are many), and for now he pointed me to some documentation for ast-grep that does a nice job articulating how someone might use their foundation and apply it to other contexts--a strategy that I might consider for my own tool (and outfitting it with other language capabilities for cyclomatic complexity).

0 replies

jnormile · 2023-11-21T15:36:16Z

jnormile
Nov 21, 2023
Author

Tuesday, November 14

Another slightly belated recap. My time with GK was spent demonstrating some of the SE tooling I added to plexity--i.e., unit testing, which was more painful to implement in Rust than I would have otherwise predicted--and getting his thoughts on unit testing strategies (namely, how much of the increasingly growing codebase to cover with test cases).

Additionally, we spent time aspirationally thinking about my work on the comp as well as my contributions to GK's 203 course. He directed me to the PyCon 2024 website and asked me to spend some time thinking about whether I'd be interested in helping with poster efforts for chasten, a tool being developed in his 203.

0 replies

jnormile · 2023-12-01T18:32:15Z

jnormile
Dec 1, 2023
Author

Thursday, November 30

Since my last meeting with GK, I had started in on the actual writing of the comp. While putting pen to paper, I realized that I had (at best) a fuzzy and not particularly compelling idea of the ethical implications of my project--somewhat problematic since it's meant to play a relatively large role in the chapter content. My time with GK this week was spent bouncing ideas off of each other to bridge this gap, and I walked away with several ideas for ways to further flesh out the ethical implications of complexity metrics.

0 replies

jnormile · 2023-12-08T16:44:50Z

jnormile
Dec 8, 2023
Author

Ethics Discussion Dumping Ground

This is simply a place for me to keep my scattered thoughts on productive discussion of the ethical implications of my project:

Scientific Taylorism
Missing the forest for the trees
Missing out on "soft metrics"--psychological safety, etc.
Look at blog posts about assigning too much importance to commits
Look at the story of Google--focusing on Taylorism/KPIs and pivoted from it
Construction industry stagnation -- no novel metrics

0 replies

gkapfham · 2024-01-01T17:01:36Z

gkapfham
Jan 1, 2024
Maintainer

Hello @jnormile, thanks for your efforts in documenting your research through this notebook. Overall, the entries that you prepared are very good. Thanks again! I suggest that you could improve some of your entries if you made the TODO lists actual lists of TODO entries and then made it more clear when you had actually completed the task. Also, I encourage you to include a few more details about the work that you completed at the very end of a semester. Okay, I look forward to working with you during the upcoming semester!

0 replies

jnormile · 2024-01-26T18:31:23Z

jnormile
Jan 26, 2024
Author

Wednesday, January 24

Today kicked off my meetings with GK for the spring semester. Today we went over the feedback from my chapters GK had provided over the break. Much of the feedback had already been addressed, and after some discussion GK pivoted based on my input; the outstanding to-dos are primarily:

Pepper in references to code coverage (such as in cyclomatic complexity discussion, or conversation related to nesting)
Fix outstanding issues with references (typos, incorrect capitalization, etc.)
Consider adding a single graphic that captures the key differences between the complexity approaches being discussed (and appropriately motivates the existence of my tool)

0 replies

jnormile · 2024-01-29T17:15:08Z

jnormile
Jan 29, 2024
Author

Monday, January 29

My meeting with GK focused on to-do's pertaining to the actual software tool itself. The main take-aways/priorities for me to focus on consisted of:

Consider/start building additional tooling for data analysis/collection--what does that look like, how does it work?
Find outside tool for computing cyclomatic complexity (for eliminating threats to validity)
Continue to work on filling out the test suite

0 replies

jnormile · 2024-02-11T18:02:43Z

jnormile
Feb 11, 2024
Author

Friday, February 9

Today GK and I briefly discussed the content I had prepared for my draft of Chapter 3, and also talked about some of my thought processes regarding upcoming data collection. Outstanding TODO's include:

Potentially look into an educational license for SonarQube (to avoid manually computing cognitive complexity); though upon further discussion/reflection, manual computation might be more appropriate
Complete remaining content for Chapter 3 draft, using content from original pitch presentation as basis for remaining discussion

0 replies

jnormile · 2024-02-12T17:11:39Z

jnormile
Feb 12, 2024
Author

Monday, February 12

Not much new to report since my last meeting had gotten bumped up. This was a brief session with GK, with the conversation focusing on what the subjects for experimentation might look like (in terms of programming language). The main question is this: do I focus on subjects in one language (i.e., Python), or do I consider subjects across a wide variety of languages?

My gut tells me to go with the latter--the benefits of this approach quickly and readily emerge: I can highlight the multi-language support of the tool, showcase novel cases for considering complexity where it hasn't been previously considered (e.g., YAMLs, Dockerfiles), and even examine the complexity of entire projects (core src files as well as supporting infrastructure files like YAMLs or JSONs).

It'd also be worth considering doing this in two "stages"--one stage that is focused on a single language for comparison with other complexity metrics, and another that considers novel uses of plexity.

I don't have any additional TODOs to report, apart from thinking about the above question and arriving at some consensus before I start in on experimental evaulation of plexity and writing content for Chapter 4.

0 replies

jnormile · 2024-02-16T19:25:24Z

jnormile
Feb 16, 2024
Author

Friday, February 16

Today, during the 610 afternoon classroom session, I partnered with Danny from the SE exemplar group and watched him demonstrate some of the tooling he's working on developing, as well as shared my own comp artifact. (Turns out there's some striking similarities between the two!)

I offered some insights for a roadblock he was running into, and jotted them down in the GitHub issue linked here.

0 replies

jnormile · 2024-02-21T20:35:11Z

jnormile
Feb 21, 2024
Author

Wednesday, February 21

Today in my weekly meeting with GK I bounced some additional thoughts from our previous conversation--what does my experimental results section look like?

We've landed on the following agreed-upon plan:

Select a few files that can draw comparisons between plexity and other established metrics (Halstead, cyclomatic, cognitive) to talk about some basic differences between the metrics--this will largely just be an extension of the claims/arguments made in prior chapters
For the bulk of the chapter, sticking with the same repositories that the already examined files come from, apply plexity to other files that other complexity metrics can't weigh in on: anything from .toml, .yml, .gitignore, .xml, .json, etc.; take the time to articulate how each of the different items on the plexity scorecard can be used to articulate assertions on complexity (# of nodes as a measure of scale, deepest node found as an indicator of potential problem spots, average node depth as an at-a-glance assertion about complexity); also take time to re-explain why the plexity approach is to provide a spread of different aspects, as opposed to a single value (like cyclomatic/cognitive)
Bring it home with a discussion on repository proportions. What proportion of the repos examined are comprised of files that can be examined by other complexity metrics? What proportion are comprised of files that only plexity can articulate about?

0 replies

jnormile · 2024-03-01T18:40:32Z

jnormile
Mar 1, 2024
Author

Future Research Ideas!

Friday, March 1

Today's meeting with GK consisted of me articulating progress with the experimental evaluation (parsing through vscode using my tool). Additionally, we spent some time concretely defining steps for future research for chapter 5, which are detailed below!

Shortcomings for future research:

Is there a way for the tool to reason about correlations between complex files and build failures? (Obviously, no.)
- Potentially do something anecdotal by just clicking around GitHub!
Dearth of grammars with working Rust bindings
Inability to automatically detect filetype
By extension, the above prevents the quick analysis of more than one file at a time
Steps required to add other existing complexity metrics to the plexity scorecard
Conducting a human study to affirm that intuitions about complexity match up with actual developer perception
Investigating the weighting of control flow structures (borrowing from the definition of complexity offered by cognitive/cyclomatic complexity)

0 replies

jnormile · 2024-03-15T17:38:29Z

jnormile
Mar 15, 2024
Author

Monday, March 11

I met briefly with GK--the focus of this particular conversation was catching him up on what I've done/what still has to be done.

The list of outstanding TODO's is as follows:

Complete processing vscode files using plexity (a little less than 2k left)
Using data from processed files, finish content for Chapter 4
Write a rough draft of Chapter 5 using the many ideas surfaced in this research notebook
Polish the artifact! Double check that there are passing builds, documentation is squared away, etc.

0 replies

jnormile · 2024-03-25T16:17:30Z

jnormile
Mar 25, 2024
Author

Monday, March 25

Today's meeting was focused on discussing questions/concerns around the thesis defense presentation. Some of the key insights GK presented to me were:

Clearly signposting the fact that both qualitative and quantitative analyses were performed
Also clearly signposting the scale/size of vscode--it's not a sample of 1, it's a sample of 7,117!
VERY briefly foreground background/motivation--no more than a minute or two
Be sure to include ethical implications in some capacity
Speak broadly about the areas of future work--generalized roadmap, maybe? Pitch this to 580 students!
For demo, talk through metrics for very simplistic example; then just show that it works for a file of much greater size, scope (performant)

0 replies

gkapfham · 2024-05-06T15:56:32Z

gkapfham
May 6, 2024
Maintainer

Hello @jnormile, you have submitted a very good research notebook for the Spring 2024 semester. Thanks!

0 replies

Ready Set Research -> 2023 - 2024

Jeff Normile: Research Notebook #7

jnormile Sep 13, 2023

Replies: 29 comments

jnormile Sep 14, 2023 Author

Generating an SE Comp Idea

Monday, Sept. 4

Thursday, September 7

Tuesday, September 14

jnormile Sep 14, 2023 Author

Research Dump No. 1: Existing Code Complexity Metrics

jnormile Sep 15, 2023 Author

Thursday, September 14

SE Requirements

jnormile Sep 19, 2023 Author

Technical Reference Dump No. 1: Using tree-sitter

jnormile Sep 19, 2023 Author

Tuesday, September 19

jnormile Sep 22, 2023 Author

Research Dump No. 2: In Defense of tree-sitter

jnormile Sep 26, 2023 Author

Technical Reference Dump No. 2: Language Detection

jnormile Sep 26, 2023 Author

Technical Reference Dump No. 3: GitHub Actions

jnormile Sep 28, 2023 Author

Tuesday, September 26

jnormile Oct 3, 2023 Author

Friday, September 29

jnormile Oct 5, 2023 Author

Tuesday, October 3

jnormile Oct 12, 2023 Author

Thursday, October 12

jnormile Oct 26, 2023 Author

Thursday, October 26

jnormile Nov 1, 2023 Author

Tuesday, October 31

jnormile Nov 14, 2023 Author

Tuesday, November 7

jnormile Nov 21, 2023 Author

Tuesday, November 14

jnormile Dec 1, 2023 Author

Thursday, November 30

jnormile Dec 8, 2023 Author

Ethics Discussion Dumping Ground

gkapfham Jan 1, 2024 Maintainer

jnormile Jan 26, 2024 Author

Wednesday, January 24

jnormile Jan 29, 2024 Author

Monday, January 29

jnormile Feb 11, 2024 Author

Friday, February 9

jnormile Feb 12, 2024 Author

Monday, February 12

jnormile Feb 16, 2024 Author

Friday, February 16

jnormile Feb 21, 2024 Author

Wednesday, February 21

jnormile Mar 1, 2024 Author

Future Research Ideas!

Friday, March 1

jnormile Mar 15, 2024 Author

Monday, March 11

jnormile Mar 25, 2024 Author

Monday, March 25

gkapfham May 6, 2024 Maintainer

jnormile
Sep 13, 2023

jnormile
Sep 14, 2023
Author

jnormile
Sep 14, 2023
Author

jnormile
Sep 15, 2023
Author

jnormile
Sep 19, 2023
Author

Technical Reference Dump No. 1: Using `tree-sitter`

jnormile
Sep 19, 2023
Author

jnormile
Sep 22, 2023
Author

Research Dump No. 2: In Defense of `tree-sitter`

jnormile
Sep 26, 2023
Author

jnormile
Sep 26, 2023
Author

jnormile
Sep 28, 2023
Author

jnormile
Oct 3, 2023
Author

jnormile
Oct 5, 2023
Author

jnormile
Oct 12, 2023
Author

jnormile
Oct 26, 2023
Author

jnormile
Nov 1, 2023
Author

jnormile
Nov 14, 2023
Author

jnormile
Nov 21, 2023
Author

jnormile
Dec 1, 2023
Author

jnormile
Dec 8, 2023
Author

gkapfham
Jan 1, 2024
Maintainer

jnormile
Jan 26, 2024
Author

jnormile
Jan 29, 2024
Author

jnormile
Feb 11, 2024
Author

jnormile
Feb 12, 2024
Author

jnormile
Feb 16, 2024
Author

jnormile
Feb 21, 2024
Author

jnormile
Mar 1, 2024
Author

jnormile
Mar 15, 2024
Author

jnormile
Mar 25, 2024
Author

gkapfham
May 6, 2024
Maintainer